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A FAMILY OF SIMPLE GROUPS ASSOCIATED WITH THE 
SIMPLE LIE ALGEBRA OF TYPE (F,).* 


By Rimuak 


Introduction. In [4] the author constructed a family of new simple 
groups by applying to the Chevalley groups of type (G2) a method which 
was obtained by analyzing Suzuki groups [5]. In this paper, we obtain 
another family of simple groups, which also seem to be new,” by applying 
the same method to the Chevalley groups (Ff,). The orders of the finite 
groups in the family are given by 


where g = 2?"*1, n = 1,2,3,:--. 
The construction is based on a certain automorphism ® of the Chevalley 
group of type (F',) over a field of characteristic 2. The existence of such an 
automorphism is already suggested in [38, Exposé 24, pp. 4-5]. Our group 


turns out to be the group of invariant elements under the automorphism. 
Throughout this paper, we shall use the notation of [4, Section 1]. 


1. The root system of type (F,). The root system & of type (F4) 
consists of the 48 roots: 


(1.1) & &+8 


where: |i], |j|, are distinct; 
&é;, for all 1. 
Let P be the additive group generated by 3, and V=R®P the vector 


* Received September 13, 1960; revised February 21, 1961. 

1 This work was done while the author held a Research Associateship, 1959-1960, 
of the Office of the Naval Research, U. S. Navy. The content of this paper, together 
with that of [4], was announced in a lecture given at the American Mathematical 
Society Summer Institute on Finite Groups, Pasadena, California, on August 22, 1960. 

The author would like to express gratitude to J. Tits for his perusal of the manu- 
script and many valuable suggestions. 

* J. Tits informs the author that he has proved, by treating the orders of the finite 
groups by Artin’s method [6], that the finite groups in our family are actually new. 

* The author learned at Pasadena that the idea of obtaining simple groups by using 
this automorphism was also conceived, independently, by Robert Steinberg. 


401 


— 

| 


402 RIMHAK REE. 


space over the real number field R. Define the positive definite metric (¢ | 7) 
of V by (&|&) =1, (&|&) =0 for i,j =1,2,3,4; ij. Then one can 
verify that the number s(r) defined in [4, Section 1] can be written 
s(r) =2(s|r)/(r|r). Hence for the element w, in the Weyl group W 
of 3, we have w,(s) =s— (2(s|r)/(r|r))r for all s€ 3. From this it 
follows that W leaves the metric (é|7) invariant. By (1.1) one can sce 
easily that (r|1r) =1 or 2, and that (r|s) =0 or +1 for any r,s€ 3. We 
shall write A(r) = (r|r). 
We shall prove a few lemmas which will be used later. 


(1.2) If r,s,r+s€ 3, A(r) =A(s) —2, then A(r+s) 


Proof. 
+ 2(r|s) (s|s)=4+ 2(r|s) =1 or 2. 
Then, since (r|s) 0 or +1, we must have (r|s) =—1, A(r+s) =2. 


(1.3) If r,s,r+s€3, A(r)=—1, A(s)—2, then 2r+s€3, and 
A(r +s) =1, A(2r+s) =—2. 


Proof. A(r+s) =(r+s|r+s) =3+2(r|s) =1 or 2. Since (r|s) 
=0 or +1, we must have (r|s) =—1, A(r+s)=—1. Then A(r—s) 
=3—2(r|s)=—5. Hence r—s¢X. Then s(r) =2(s|r)/(r|r) =—?2 
implies that 27-+s€ 3. We have A(2r +s) =44+2+42(r|s) =2. 


Now, define the linear map y: VV by 
) =& + &2, (é2) = — (és) == + &4, =f, &. 
Then we have (cf. [3], Exposé 24, pp. 4-5) 

(1.4) There exists a permutation r—F of order 2 of & such that 
w(r) =A(r)?, —r—=—F, A(r)A(7) for all 

By the definition of y, one can see easily that y= VY 2ys5, where ys is an 
orthogonal transformation (in fact, a symmetry) of the vector space V. 
From this and (1.4) we have 

(1.5) A(s)(F| 5) =A(F) (7 | 8) for all r,s€ 

(1.6) w,(s) =w,(S) for all r,s € 

Proof. This follows easily from (1.5). 


(1.7) The map w>H#—yowoy is an automorphism of the Weyl 
group W such that w;—w for all r€ 3. 
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Proof. This is immediate from (1.6). 

(1.8) lfir,sr+s€ 3, and %, then A(r) =A(s) = 1,A(r + 8) = 

Proof. Since 7+ 5¢ 3%, we have 5(7) 20, (#|5) 20. Then by (1.5), 
(r|s) 20. Now, (r+s|r+s) =(r|r)+2(r|s) + (s|s)=1 or 2. Hence 
we must have =A(s) —1, (7 | s) =0, A(r+s) =2 

(1.9) Ifrée€ S.A(r) =1, r+ F€ then A(r + 7) —1 and 

Proof. Since =2 by (1.4), A(r+7)— 1 follows from (1.3). 
Then r+ =F+ 


(1.10) The group W? of all we W such that #—w is of order 16, 
and consists of w(e.8,00), w(o.e,8), and w(e,8,7), +1, which are 


defined by: 


e(é, + + 86(é&+&)), 

Es —> dn + &) —8(E + &)), 

dn(e (é, — & ) — 8(& —&,)) 
Proof. Let #=w. Since A(wé;) =1 for all 1, we have wé, = + & or 
Where ¢, = +1. If wé; = + &, then the property (wé, | wé&) =0 for 


i41 shows that all wé; are of the form + é;. Then by using the condition 
yow== wow one deduces easily that w= w(e,8,0) or given above. 
If wé, is of the form $3eié;, the same argument as above shows that all wé; 
have the same form, and from this one can deduce w= w(e, 8,7). 


2. An automorphism of the Chevalley group. The basis elements of 
the simple Lie algebra of type (Fy) given by Cartan [1, p. 224] can be 
modified, by multiplying the coefficients + 1 suitably, so that they satisfy 
the conditions in Theorem 1 of [2, p. 24]. If, however, the ground field K 
is of characteristic 2, then Cartan’s basis elements can be used directly to 
define the Chevalley group G@ of type (Fy) over K (see [4, Section 1] for a 
definition ). 

We shall write 2;(¢), vi;(¢), to denote the elements «,(¢), where 


‘The group W? is actually dihedral. 


6,00): 
w(oo,e, 8): 
w(e, 8.) : 
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r are, respectively, the roots given in (1.1). In this notation, we shall 
write i instead of —i. Thus, for example, 2ij-~(t) denotes 2,(¢), where 
r=4(&+6—&—&). This way of indicating roots which appear as sub- 
scripts will be applied to other symbols also. 

By the multiplication table given by Cartan [1, p. 224] and the formulas 
given by Chevalley [2, p. 26] one obtains the following expressions for the 
commutators (assuming that K is of characteristic 2) : 

(ai(t),«)(w)) =1, 

(a;(t), tv;(u) ) 

(xij (t), = (tu), 
(2.1) (a(t), ) = (tu), 

(xij(t), ) = (tw) (tu’), 

(rina (t), (UW) ) = (tu), 

(rina(t), Viner (U)) =1, 
where (z,y) =2a'y"tay, and where all the other commutators of the form 
(a,(t),2,(u)) are 1. 


It is easy to see that (2.1), above, is equivalent to (2.2)-(2.5), below: 
(2.2) If r,s€3, r+s¢%, then (2,(t),2(u)) =1. 


(2.3) If r,s,r+s€ 3, A(r) =A(s) =1, then (2,(t), x,(u)) = 2,.3(tu) 
or 1, according as A(r +s) —1 or 2. 


(2.4) If r,s,r+s€ 3, A(r) =A(s) =2, then (2,(t), = 7+5(tu). 
(2.5) If res,r+s€3, A(r) =1, A(s) =2, then 
Now we shall prove 
(2.6) Let t— ?#® be an automorphism of the field K. 
Then, in (2.1), we can replace x,(t) by 
E,(t) = a7 (0?) 
without losing the validity of the statements. 


Proof. Denote by (2.n’), 25, the statement obtained from (2.1) 
by the replacement 2,(t) > @,(¢). It suffices to prove that (2. 2’)-(2.5’) are 
valid. 

We shall prove (2.2’). If *4+ 5¢3%, then (2.2’) follows immediately 
from (2.2). If *#+5€ 3%, then by (1.8) (using 7, 5 instead of r, s) we have 
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=A(S) = 1, A(F +5) =2. Hence by (2.3) we have (a7(t),vs(u)) =1 
from which (2.2’) follows at once. 

We shall prove (2.3’). If A(r+s)=—1, then 
—r+s€3. We have A(*7) =A(S) =2. Hence by (2.4), 


(z(t), ) (a7 ag (u?4) ) 


If +s) =2, then F+S5—y(r+s —2(r+s) Hence we obtain 
(7,(t), (w)) = 1 by (2.2). 
We shall prove (2.4’). We have A(r7+s)—2 by (1.2). Hence 


2s—y(r+s) 2(r-+s), Fl ssr+s€3, Then by 
(2.3) we have 

We shall prove (2.5’). By (1.3) we have A(r +s) =1, A(2r-+s) =2. 
Hence 2F+ 25 =y(2r+s) =2(2r+s). Thus Also, 
Hence 

(%,(€), ) (a7 (479), ve (u’)) = (25 (u*), 


Thus the proof of (2.6) is complete. 
Using (2.6), above, and proceeding as in [4], one can prove 


THEOREM 2.7.2 If K is a perfect field of characteristic 2, and if t— t® 
is an automorphism of K, then the Chevalley group G of type (F4) over K 
admits an automorphism x— x such that 


aty(t)? = a7 
forallre Sand te K. We have 
h(x)? =h(Xx). Oy? = 
for all XE X and r€ %, where x is defined by 


X(r) (1) =x 


° After the completion of earlier versions of this paper and [4], the author learned 
from Tits that the existence of the automorphism o was already mentioned in [9], p. 282. 
See also R. Steinberg [8]. 
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3. Invariant elements. From now on we assume that K is a field of 
characteristic 2 and that K admits an automorphism ¢— ?* such that 


(3.1) 267 = 1, 
the right hand side being the identity automorphism.° 


Let G be the Chevalley group of type (F',) over K and «— 2? the auto- 
morphism of G given in Theorem 2.7. An element x€ G will be said to be 
a-invariant if 7 

We shall order the vector space V (and hence 3%) lexicographically with 
respect to the coefficients of &, &, é, Thus => cé > 0 if €~0 ani 
if the first nonzero number of C2, C3, is positive. 

Let 1 (resp. 8) be the subgroup of G generated by 2,(¢) with +> 0 
(resp. with r << 0). Denote by U' (resp. B') the group of o-invariant elements 
in U (resp. in B), and by G* the group generated by 1U' and Bt. Denote by 
' the group of all o-invariant elements in §, the group of all h(x). x€-\. 

First we shall consider 1’. We shall denote the roots in (1.1) by /. ij. 
ijkl, respectively, writing v’ for —i. The set of all positive roots in & is 
decomposed into the following 8 disjoint subsets: 


E, = {2, 12’, 1, 12}, EH, = {14, 1234’}, 
EF, = {4, 34’, 3, 34}, = {14’, 123’4}, 


(3. 2) 7 fe , € 7’ ve 
EB, = {12'34’, 24, 1234,13}, 1284}, 
B, == {12/3'4, 24’, 123'4’, 13’}, Ey = {23’, 12'3'4’}. 


The sets #,, #2, and are of the form {7r,7,r 7, 2r-+ 7}, while 
and EF, are of the form {7,7}, where Since r+ F—2r+i, 
for #,,- - -, #4, one can see easily that, for each H;, the group generated by 
ws(t), s€ Hi, t€ K is mapped onto itself by the automorphism o of G given 
in (2.7). For {7,7}, A(r) =1, an element 2,(t)a;(w) is o-invariant 
if and only if =u’. For #;—{r,7,r+7,2r+ 7}, let 2 be an element in 
the group generated by all 2,(¢), Hi, K. Then 
am (2, te) (ts) ( te) 


by (2.5). Hence, by (3.1), if and only if ¢;—#.9, t, t,%! + 
3 1 2 3 2 4 
Thus, if x is o-invariant, then x is written uniquely in the form 


° It was discovered by Tits that K need not be assumed to be perfect if one considers 
an endomorphism 7, instead of 0, of K such that 7? = 2. For the details, see [10]. 
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where s=7-+ 7, and where t, w€ K. The o-invariant elements obtained from 
each E; in the above manner will be called the o-invariant elements associated 
with 

Now we shall define the following o-invariant elements: ‘ 


a(t) = 24 (t) 23 
a(t) = 2x5 

= 714 (Ft), 


We shall denote by R; the set of roots appearing in the above expression 
of a(€), and by min(R&;) the minimum root in Ff; Thus, for instance, 
R, = {&, é, —&,.é,}, and min(R,) =é,. We can verify from the above that 
(3.3) min(R,) <min(R.) + >< min(R,.). 

We shall prove 

(3.4) If an element 


t= Ty, ) ( tm) 


where all 7; > 0, is o-invariant, then «x can be written uniquely as 


~ 


with u;€ A. For any « such that wu; 0, any root in #; is a linear combination, 


with integral coefficients = 0, of the roots 7, 7m- 


Proof. By (1.2) of [4], we can assume without loss of generality that 
+ and that all {;40. Since is o-invariant, 


v= (7, ) a7, (te) tr (in); 
where, for all 7, 
= 
* For an idea of avoiding the use of the elements a,(¢),° °°, a(t), see [10, section 
2|. The conjecture stated there was proved by Tits himself. 


= 
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Let #; be the minimum root in the set {7,,72,: - -,fm}. Then clearly x can 
be written in the form 
Dz, 2’, 


where x’ is a product of elements 2,(¢) with r>7;. Then by Lemma 6 of 
[2, p. 39] we have 
We can clearly write 


where 2” is a product of the elements z,(¢) such that r is a linear combination, 
with integral coefficients = 0, of the roots 7,72, + *,7m, and such that r > 1,, 
Now (3.5) and (3.6) show that there exists an a,(v) such that «= a,(v)y, 
where y is a product of the elements z,(¢) such that r is a linear combination, 
with integral coefficients = 0, of the roots and such that r > 
Now we can proceed by induction on r;, which is clearly uniquely determined 
by «. If r, is sufficiently large, then (3.4) is clear. Hence by applying the 
induction assumption to y, we have 

where ky <<: <k,. Clearly k< (if 0). Hence we conclude that 
x can be written in the desired form. 

The uniqueness part follows easily from Lemma 6 of [2, p. 39] and (3.3), 
above. The rest of (3.4) is also clear. 


(3.7) For any 1, 7, the commutator (a(t), a;(w)) can be written as 
where min(R#;,) = min(F;) + 

Proof. By (1.2) of [4] one can see easily that the commutator 
(a:(¢),a;(w)) can be written as [[2,(t,), where r= min(F;) + min(/)). 
Hence (3.7) follows immediately from (3.4). 

Denote by ll‘ the group generated by a(t), +, @2(t), CE K. 
Note that, by (3.4), the new definition for 1? is equivalent to the old one 
given at the beginning of this section. Also, set 1*— {1}. Then from 
(3.7) one can derive 


(3.8) Let 1112. If then (2, y) € 1. 


For any element w€ W, define ll, to be the subgroup of 11 generated 
by all z,(#) such that w(r) <0. If we W* (see (1.10)). and if w(r) <0 


all 
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for some € then by (1.7) we have <0 for all r€ In this case, 
we shall write w(R;) <0. We have 


(3.9) If we W?, then every element in U,M WU is written uniquely 
as = (t,) Where unless w(Ri) < 0. 


Proof. This also follows immediately from (3.4). 


Now we shall consider the group ¥'. By considering negative roots 
instead of positive roots, one obtains, as generators of %', the elements 
a’, (t), where a’;(¢) is the element obtained by replacing 
every z, by x, in the expression of «;(¢). Thus, for instance, 


We shall prove 


(3.10) Let #;= {r,7,r+7,2r+7} be any one of £,, H., given 
in (8.2), and let 
aj(t) 
Oy (t) = Loy x(t) 


be the o-invariant elements associated with If then 


(t) = *h (x) wr (8%), 
where X is given by 
X = X7, 
Proof. We have 
(t) = U_(or+F) (t) 
By (6.2) in [4], we have 


(t) = Lor,7 (t-*) h(x:) (E77), 
where 
Xi = Xers7,t7- 
Since, by (2.5), 


(t) (t*) U_ (rs?) 
= (t*) 2, (14) 120-2) 
== a, a7 (82-79) (Xe) wr 


where 


Xo = Xir,t1-20, 


410 RIMHAK REE. 


we have 
a(t) = (t*) (t9*) h (Xe) 
= ay ay (42-79) (Xz © we) oF 
tr (Xs (7) (X; (r + worst 


2U+% 
-(rt+r)< 
-(2%+%) 


FIGURE l. 
We have 
Xi (#7) —=1, +7) and xX,°wr—X, 


since Hence 


= 2, (t9*) (2-9) he (X) oF 


where X is as given in (3.10). From this and (2.5) one can derive the 
desired result easily. Thus (3.10) is proved. 
Set in (38.10). Then h(x)—1. Hence we have G. 


Hence by setting #; = we obtain 
(3. 11) contain the elements W34’W34, Wo4'W 13’. 


Now, any one of Hy, Hs is of the form = {1,7}, where r+ 7 
Hence if )ar(t) (we assume that A(r) =1) are the 
o-invariant elements associated with /;, then 


wrwr = € @. 
Hence, by setting 1; = H;, E;, we obtain 
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It can be easily seen that the elements Wy2, Wo4W13, 
ANA generate the group W? defined 
in (1.10). Hence (3.11) and (3.12) yield 

(3.13) For every w€ W? we can take w(w) in G" so that o(w)? =o(w). 

In what follows we shall take in G* for all we W?, o(1) = 1. 

Now we shall consider the group ' of o-invariant elements in §. As 
in Theorem 4.8 of [4] we have 


h(x)? 
where X is defined by 


X(r) =x(p(r) 
Hence we can deduce easily 
(3.14) h(x) € * if and only if 
X (>) —=X(E1) 4, X (Es) = X(Ea) 


If {r,7,r +7, 2r+ 7} is any one of F., Hs, Hy, then by (3.11) 
we have h(x) € G', where X is given in (3.10). A simple calculation shows 
that if #;— #,, then 


X(E.) = x(Es) 


— 


On the other hand, if #;— F., then 
=X(&) =1, xX(és) = X (és) == 074-2, 
Since ¢ is an arbitrary element in K*, we have by (3.14) 
(3.15) 
Now we shall prove 


(3.16) G* is the group of all o-invariant elements in G. Any element 
is written uniquely as where UW, h(x) € 


Proof. By (1.13) of [4], any w€G@ can be written uniquely as 
r=uh(X)o(w)u’, where U, h(x) € we W, we Uy. Then 


= 


Proceeding as in the proof of (5.16) of [4], we have {(w(w)’) =w# and 
(Ue)? = Uz. Hence wo(w)? =h(xX,)o(@) and uw’ € Therefore, if 27 = x, 
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, 


then by the uniqueness we have =u, X=XX1,, ©=w, 
X,=1, x=x. Now (8.16) follows from (3.18) and (3.15). 


Hence 


4. The center of ll’. As a preparation for the proof of the simplicity 
of G+, we shall prove here that the center of 1 consists of the elements ¢,.(f), 
t€ K. In this section, we do not assume that K has more than 2 elements, 
although we will prove, in the next section, the simplicity of G* under that 
assumption. 

Let the groups U1, U’,---,1° be as in the preceding section. By 
(3.8), 1’ is clearly a normal subgroup of 1. 

All the expressions for the commutators 


(a; (t), aj(w)) == a; (w) 


used throughout this section can be derived easily from (2.2)-(2.5). 
Now, let 


be an element in the center of 11. 


We shall first prove ¢, —¢t,—0. We have 


(4.1) (a(t), =1 (iA1,4) 
Note that (4.1) for i=8 follows from (3.7). Also 

(4.2) (a(t), %0(w)) = a1: (tu). 

(4.3) (a4 (t), = (tu). 


Hence %o(U)*¢a%o(u) is equal to 


Equating this to c, and noting that «,.(¢) is in the center of 1’, we have 
%1,(4,U)a,.(tu) =1. Hence tru—tw—0 for all we K. Hence we obtain 
be — 0. 


We shall prove ¢; 0. We have 
(4. 4) (a,(¢),a.(u)) =1 (1 = 2, 5,6) 
(4.5) (a5 (4), = (tw) (tu?) as (12420) 


By (4.4), (4.5), above, we can easily compare c (mod 11°) and a.(u)-tcx,(w) 
(mod 11°). and obtain fz —0. 
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We shall prove 4,0. By (4.4) and (4.5) we have 
(a(t), %3(w) ) =a; (mod US). 
Combining this with 
(4.6) (s(t), a5(u)) — as (tu) 


we can derive easily t, = 0. 


For the determination of ¢;,t.,- we need 
(4.8) (a(t), a3 ) = (tu?) (79x), 
(4.9) (t+—8,9,10,12), 
(4.10) (a11(t), = Gio (tu). 


By (4.7)-(4.10), above, a;(w)-*ca,(w) is equal to 


O10 (tg) (ty) %10(t10) (t11) G12 (tre). 


Equating this with c and simplifying the relation by 


(a(t), = (A(t), as ) = (H(t), (w) ) 
= (%0(t), as (WU) ) = ) = 1, 


(4.12) (ag(t), a7 (u) ) = 


(4.11) 


we obtain 
+ teu?) + + 


for all w€ K. Hence we have tj; =t,—t,—t,, —0. 


Now, simplifying the relation «;(w)-*ca,;(u) =c by (4.11), (4.12), and 
(%19(t), @-(w)) = (a1 (t), (u)) =1, we obtain —0. 

Similarly, simplifying the relation =c¢ by (a10(t), %6(w) ) 
= =1 and 


(4.13) (a(t), ) = 
we obtain ¢, — 0. 


Finally, simplifying the relation a(w)‘*ca,(u)=c by (4.3) and 
(%1:(t),a,(w)) —1, we obtain ¢;,—0. Thus we have proved 
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THEOREM 4.14. The center of U consists of the elements a,.(t), t€ K, 


5. The simplicity of G'. We shall prove the simplicity of G* under 
the assumption that K contains more than 2 elements. If K contains only 
2 elements, then G* is of order 21*-3°-5?-13 (see Section 6). The author 
does not know the composition factors of this group. 

First we shall prove a lemma which requires the above mentioned 


assumption. 


(5.1) If we W', wA1, then there exists an AE such that 
w (Ww ) h hw ( w). 


Proof. Since w(w)h(x)o(w)* h(x’), where x’ is defined by x’(r) 
=X(w(r)), it suffices to derive a contradiction by assuming that x(wr) 
=xX(r) for all r€ = and all x such that h(x) € §}. 

If w=w(e,8,0) or w(o,¢,8) (see (1.10)), then implies that 
w(é,) =—é, or w(é,) = Then x(é,)?=1 or x(é:) =x(+ 63). This 
is clearly impossible, since, by (3.14), x(é,) and x(&3) can be regarded as 
arbitrary elements in K*, and since K contains more than 2 elements. 

If w—w(e,8,7), then from xX(wé,) =x(é,) and x(wé,) =x(é;) one 
derives easily x(é: + &)*=x(&)xX(és)” which is also clearly impossible by 
the reason stated above. Thus (5.1) is proved. 

Now we shall proceed to prove the simplicity of G. Let HA {1} be a 
normal subgroup of G*. By (3.16), H contains an element «= uho(w) ~1 
with w€ Ut, hE Ht, and we Wt. If then, by (5.1), there exists h, € 
such that o(w)h;Ahy(w). We have 1Azhath,* €WH'N H. Thus we 
have proved that H contains an element c—uhA1, where hE 
Suppose h =h(xX)~1. Then H contains, for any t€ K*, 


Y = = (x(é: + &)—1)t) 


and 


= (t) (t)-* = ao ((X(Es + és) —1) t)y”, 


where y” the group generated by the elements -,@,.(t), K. 
Hence if yy’ =1 for all ¢€ K*, then by (3.4) we would have x(é, + é2) 
= x(é; + €,) =1, and hence by (3.14), X=1, a contradiction. Thus we have 
proved that H contains an element xA1 in UW. Suppose z is not in the 
center of U1’. Then there exists y€ U' such that the commutator (7, y) £1. 
If x€ UW‘, then, by (3.8), (z,y) € U**. Hence H must contain an element 
~1 in the center of 1. By (4.16), H contains «,.(¢t) with some ¢0. 
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Then by considering h(X)a12(t)4(xX)-* we see easily that H contains a,2(¢) 
for all ¢€ K. 

Since G* contains wp) = o(w(—1,—1,0)) by (3.13), H also contains 
x',»(t) =o%2(t)wo for all t€ K. Now, setting r—é, in (3.10), we have 
4,—=%,.. Hence by (3.10), H contains h(X)212, where x is defined in 
(3.10) with If ¢=1, then h(x) =1. Hence H contains 
and h(x) for all t€ K*. We have 


(5. 2) X = x (Es) = = 1. 
Hence. for any w€ K and t€ K*, H contains 


a; (U)h(X) as (w) 
= a5 (u + ( 4 ) 


Therefore H contains a;(u+?4u) for all we K and te K*. Since 
| + 1&4 —0 implies t= 1, and since K has more than 2 elements, it follows 
that H contains «;(/) for all #€ K. Thus we have shown that H contains, 
for all ¢ and w in K, the elements a;(¢)a,.(w), which are precisely the 
g-invariant elements associated with the set H, given in (3.2). 

Since w(oo,1,1), w(1,1,1), w(1,—1,1) map onto F,, respec- 
tively, it follows from (1.6) of [4] and (3.13) that H contains all the o- 
invariant elements associated with #., also. 

Now we shall turn to the o-invariant elements associated with the sets 
I’... > +, B.. We have already seen in the above that H contains h(x) with x 
defined in (5.2). Hence H contains, for any wé€ K and t¢€ k*, 


Since 1 + ¢°=0 implies {1 and since K contains more than 2 elements, 
it follows from the above that H contains 2,)(¢) for all f¢€ K. Now, ao(¢), 
‘€ K. are exactly the o-invariant elements associated with the set #;. Since 
w(1, —1,0), w(o,1,1), w(—1,1,0) map EF; onto Es, respectively, 
we conclude, as above, that H contains all the a-invariant elements associated 
with the sets Fy, H;, and Ex. 

Thus H contains all the o-invariant elements associated with H,,: - -, Es. 
This implies that H contains +, %.2(¢), for all K, since any 
is associated with some E;. Since G* contains wp) =w(w(—1,—1,0)), it 
follows that H contains @’;(t) %(t)wo' for all i=1,- - -.12, and K. 
Therefore we have H =G*. Thus we have proved 
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THEOREM 5.3. If K possesses more than 2 elements, then the group (7 
as simple. 


6. The finite groups. Let K be a finite field of g—2?"" elements, 
where n=0,1,2,---. Then the automorphism defined by 
where m = 2", satisfies 2071. Hence the group G* can be defined over K. 
We shall compute the order of G*. By (3.16), G* can be partitioned into the 
16 subsets: 

(Uy N U*), 


where w runs over the 16 elements of W*. The order of WU is, by (3.4), 
q'*, and the order of §' is, by (3.14), (g—1)*. The order of U, W is, 


by (3.9), gq, where n(w) denotes the number of indices i—1,- - -,12, such 
that w(R;) <0. Hence the order of G* is 
(6.1) 


where the sum runs over all w€ W?. 


We shall compute the numbers n(w), w€ W*. Note first that if a;(¢) 
and «a;(¢) are associated with the same set E;, then w(H,) <0 if and only 
if w(H;) <0. In fact, it can be easily seen that, if w(7r) <0 for some r€ EF, 
then w(r) <0 for all re H,. For wé W', define v(w;H,) =1 or —1 
according as w(r) > 0 for all re EF; or not. Then we have 


(6.2) n(w) (1—v(w;Fy)) + (1 —v(w;E;,)). 


The numbers »(w;/;,) can be computed easily by (1.10) and (3.2). We 
have 
The table of v(w; E;) 


w (oo, «€,8) € 8 5 — 
we, 6,1) € € € € € € 
w(e,d8,—1)|« —e € € € € 


Hence by (6.2) we have 


n(w(c«, 8,00) ) = 6 —5e—8=0, 2, 10, 12, 
n(w(o,«,8)) «—d—4,6,6, 8, 
1,8,9, 11, 
n(w(e,8,—1)) = 6 — 2e —8 =3, 5, 7, 9. 
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Therefore > g”"™ is equal to 
=(¢+1) (+1) (9 +1). 


Hence by (6.1) we have 


THEOREM 6.3. Jf K ws a finite field of qg=2°"*' elements, where 
n=0,1,2,:--, then the order of G* is given by 


q*(q—1)(¢ + 1)(¢*—1) (¢° +1). 


7. Automorphisms of G'. If t—/? is an automorphism of the field 
kK which commutes with 6, then there exists an automorphism p of G* such 
that a(t) > a(t’), a(t) for all ¢€ K and We shall call p the 
automorphism of G* induced by the automorphism t—> ?’ of K. We shall prove 


THrEoREM 7.1. If K is a finite® field possessing more than 2 elements, 
then every automorphism p of the group G* can be written uniquely in the 
form p=pips, where p, ts an automorphism induced by an automorphism 
of K, and where pz is an inner automorphism of G'. 


Proof. Arguing as in the proof of Theorem 9.1 of [4], we see that 


we can assume (7.2)-(7.5), below: 


(7.2) =, 

(7.3) for all K, 

(7.4) h(x)? =h(x) for all h(x) € §', 

(7.5) Wo? = h (Xo) wo where w) =w(w(—1,—1,0)). 


Define the roots 11,72,° *,?12 by setting if a(t) 
or if a(t) a7 (t)2,,7(t%1). Then the roots r1,- -,712 are respectively, 


34’, 34, 28’, 24’, 12’, 24, 23, 13’, 14’, 14, 13, 12, 


Where ij and ij’ denote &;+ é and é;—&;, respectively. From the definition 


of r; we have 


‘J. Tits remarks that the finiteness assumption seems to be dispensed with if one 
uses the geometric interpretation of the group @ given in [10]. 
grouy § 
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for all h(x) € $1 and t€ K. Now fix an 1, and set 

Then by (7.4) and (7.6) we have 

(7.7) (x (1) 


Hence if for the root 7;, there exists h(x) € $1 such that x(7;) =1 and 
x(rj;) ~€1, then we have ¢9)—0 for all ¢¢ K. Using this, (3.13), and the 
assumption that K has more than 2 elements, we can deduce (7%.8)-(7.10), 
below: 

(7.8) If i 1,2,5, then a;(¢)? —a;(ct), where c; is a constant in 

(7.9) Both a,(¢)? and a,(¢)? are of the form @,(¢,)a.(t2). 

(7.10) is of the form 

It can be verified easily by (2.1) that the center of the group generated 
by the elements @,(t), a(t), ¢€ K, consists of the elements a(t), K. 
From this, (7.7), and (7.8) that there exists a constant c. in K such that 
= for all t€ K. Now take an element h(x) € such that 
x(3 + = and define an inner automorphism p’ of G* by x? = h(x)rh(x). 
Then a(t)" a(t) for all t€ K, and all the statements (7. 2)-(7.10) 


remain valid when p is replaced by pp’. Hence, for the proof of Theorem 7.1, 
we can furthermore assume 


(7.11) Ao == 
By (7.7%) and (7.9) we can set 
@, == a, 
where c, and c’ are constants in K. Combining this with the identity 
(t)a,(w) =a,(t + 


we obtain 


for all tj K. By setting t= u—1 we have 1, c, Then from 
the above it follows that c’(¢?% + tu*?) =0 for all ¢, w€ K. From this and 
the assumption that K has more than 2 elements we can derive easily c’ = 0. 
Thus for all ¢¢ K. Similarly, we can prove a,(t)* = 
for all ¢€ K. Thus 


(7.12) a,(t)?’—a,(t), a;(t)?==a;(t). 


A FAMILY OF SIMPLE GROUPS. 419 


Now, applying p to the identity (4.7) and using (7.8), (7.11), and 
(7.12), we obtain Cy Similarly from (4.4) 
we obtain ¢,—1. Thus we have proved c;—1 for all 1. Thus 


(7.13) a(t)? = a;(t) (¢—=1,2,- -,12) 
By setting r=, é, in (3.10) we obtain, respectively, 


(t) == a5 (4-9) (4-79), 
(UW) == (002-20) -2 cx, (2-26), 


where: =o(w(—1,1,0)); o”’ =w(w(1,—1,0)); h’ is an element in 
$! which commutes with w” and the elements a,(t), (€ K; h” is an element 
in § which commutes with o’ and the elements a;(¢), ¢€K. Since 
w(—1.1,0)w(1,—1,0) =w(—1,—1,0), we can assume w/w” 
=w(w(—1,—1,0)). Then, since (a,(¢),¢;(w)) 1, we have from the 
above. 


with h=h’h” € §'. We have from (7.5) 


= (wo % (UW) wo)? 
= (Xo (Es + + 
= (Xo) wot 


where = From this and the relations 

woh (Xo) = h(Xo)-, 13H! Bt {1}, 
we can deduce easily h(Xo) =1. Hence (7.5) vields 
(7.14) Wo? = wo 


Since the group G' is generated by wo and @,(t),: - -,a@.(t), t€ K, we can 
conclude from (7.13) and (7.14) that p—1. Thus Theorem 7.1 is proved 
except for the uniqueness part. 

The uniqueness part, however, can be proved just as the corresponding 
part of Theorem 9.1 of [4], if one uses the fact that w) belongs to the group 
generated by the elements a,(1), %2(1), %5(1), #’1(1), @2(1), @’5(1), 
Thus the proof of Theorem is complete. 


UNIVERSITY OF BRITISH COLUMBIA, VANCOUVER, B. C. 


] 
| 


420 RIMHAK REE. 


REFERENCES. 


[1] E. Cartan, “ Oeuvres completes,” Part 1, vol. 1, Gauthier-Villars, Paris, 1952 

[2] C. Chevalley, “Sur certains groupes simples,” Téhoku Mathematical Journal (2), 
vol. 7 (1955), pp. 14-66. 

[3] Séminaire C. Chevalley, “Classification des groupes de Lie algébriques,”’ vol. 2, 
Paris, 1956-58. 

[4] Rimak Ree, “A family of simple groups associated with the simple Lie algebra of 
type (G.),” American Journal of Mathematics, vol. 83 (1961), pp. 432-462. 

[5] Michio Suzuki, “A new type of simple groups of finite order,” Proceedings of the 
National Academy of Sciences, vol. 46 (1960), pp. 868-870. 

[6] E. Artin, “ The orders of the classical simple groups,” Communications on Pure 
and Applied Mathematics, vol. 8 (1955), pp. 455-472. 

[7] R. Steinberg, “ Variations on a theme of Chevalley,” Pacific Journal of Mathe- 

matics, vol. 9 (1959), pp. 875-891. 

, “Automorphisms of finite linear groups,” Canadian Journal of Mathe- 

matics, vol. 12 (1960), pp. 606-615. 

[9] J. Tits, “Sur les analogues algébriques des groupes semi-simples complexes,” 
Colloque @algébre supérieure, Bruxelles, 1956, pp. 261-289. 


[8] 


[10] , “ Les groupes simples de Suzuki et de Ree,” Séminaire Bourbaki, December, 


1960. 


THE STRUCTURE OF FROBENIUS ALGEBRAIC GROUPS.* 


By Davin 


1. Introduction. A finite group @ is called a Frobenius group if it 
has a proper subgroup H which is “disjoint” from all of its conjugates 
(i.e. tHa* HSA {e} implies H). The set M—{G—LU {e} 


is then easily seen to be a normal subgroup, called the regular subgroup of G. 
The structure of M/ has been greatly studied and a long outstanding conjecture 
that M is necessarily nilpotent has recently been proved by J. Thompson [4]. 

In this paper we study the structure of Frobenius algebraic groups. 
An algebraic group @ is called a Frobenius algebraic group if it has a proper 
algebraic subgroup H such that for all ce G, tHa*N H ¥ {e} implies ve H. 
Denote M = {G— |) eHx"} U {e}. This notation will be kept throughout 


the paper. 


The simple counting techniques of the finite theory are no longer avail- 
able and consequently the nature of the study of Frobenius algebraic groups 
is necessarily quite different from that of the finite theory. For example, the 
fact that WV is a subgroup does not come out until quite late (Theorem 4). 
In the finite case this is proved by Frobenius using the theory of characters. 
For properties of finite Frobenius groups see [3]. 

The theory of algebraic groups is assumed. The reader is referred to 
[1] and [2] for the standard theorems and terminology. In particular, if 
(; is an algebraic group, G» denotes the connected component of the identity 
element e; G,) is a normal algebraic subgroup of finite index in G. A Borel 
subgroup of G@ is a maximal connected solvable algebraic subgroup. <A torus 
of G is a connected diagonalizable subgroup. The set of semisimple (i.e. 
diagonalizable) elements of G is denoted G,; the set of unipotent elements of 
(r is denoted G,. A unipotent group, i.e., one for which G = G,, is nilpotent. 
If a Borel subgroup of @ is nilpotent then @ is nilpotent; in particular if a 
maximal torus of G is reduced to {e} G@ is nilpotent. In general p will denote 
the. characteristic of the ground field. 

The author wishes to express his gratitude to Professor I. N. Herstein 
lor suggesting these investigations as well as for many stimulating and helpful 
conversations. 
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2. Basic properties of Frobenius groups. For g¢€G@ we define the 
rational map o,: G>G@ by o,(z) If N is a subgroup of 
normalized by g then clearly o,(N) C N, and if N is a normal subgroup of (, 
a, induces a map (still denoted o,: G/N. 


Lemma 1. Let G be a Frobenius algebraic group and N D N, algebraic 
subgroups of G with N, normal in N and such that H normalizes N and N,, 
Then 


a) forallhe H, hfe, op ts injective on N tf and only if NN H = {e}: 


b) if on is injective on N and surjective on N, then op ts injective 
7 
on N/N,; 


c) if op is surjective on N, and on N/N, then op is surjective on NV. 


Proof. a) Letne NOH; then on(n) =e —on(e), hence n=e. Con- 


versely implies nhn-* = n’hn’* and so HS {e}. 
Therefore n-'n’€ HM N = {e} and n—n’. 


b) Suppose o,(n) =o,(n’)n, with n, n’€ N,ny€ Ny. Then n’n,n’€ JV, 
and so for some n’,€ N,. Therefore o,(n) =o,(n’)ni, 
= =on(n'n’,). By hypothesis o, is injective on N, so 

=n’n’, and is injective on N/N,. 


c) Let née N; then op surjective on N/N, implies n ~o;(n’)n, with 
n’E N, n,€N,. As in the proof of (b) there exists n’,€ N, such that 
n =o7,(n'n’,). q.e.d. 


LemMMA 2. Let G be a Frobenius algebraic group and N a solvable 
algebraic subgroup of G normalized by H. For h€ H, if oy is injective on 
N then op is surjective on N. 


Proof. First suppose NV connected. If N is commutative o, is a homo- 
morphism of NV into N and since o; is injective o,(N) and N have the same 
dimension, hence o,(NV) WN. We proceed by induction on the dimension 
of VN. Let N, be the commutator subgroup of NV; since N is solvable N, is 
of strictly smaller dimension and by the induction hypothesis o;, is surjective 
on N,. By Lemma 1b oy, is injective on N/N,, which is commutative, hence 
on is surjective on N/N,. Therefore by Lemma 1c oy is surjective on NV. 

In the general case, by what we have already proved oy is surjective on 
N, so by Lemma 1b oz is injective on N/N>, which is finite. Therefore o; 
is surjective on N/N, and applying Lemma 1c we have o; surjective on V. 


q.e. d. 
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Lemma 3. Let G be a Frobenius algebraic group and N a solvable 
normal algebraic subgroup of G such that NN. H = {e}. Let p be a rational 
representation of G with kernel N, G=p(@), H=p(H). Then G is a 
Frobenius algebraic group. 


Proof. Let G such that HA {e}. Then if there 
exist, h, hy € Hy hh Ae, hy e, and n€ N such that By Lemma 
1a a, is injective on N hence by Lemma 2 oz is surjective on N ; let n = o,(n,). 
Then = = (m1) and so ny n,n HFS {e}; 


therefore n, a € H and € H. q.e. d. 


Proposition 1. Let G be a Frobenius algebraic group. Then HO Gy 
=H, and if H is not finite, H) =T is a maximal torus of G, Go is solvable, 
and G=H-M. 


Proof. Suppose H is finite and let K =H G); if s€ Ky, se, then 
s belongs to a maximal torus 7 of G. If x€ G) centralizes T, rsx 
=s€ HM cHe"' and c€ K so that TCK and T = {e} which is impossible. 
Therefore K = K,; let we K, ue. Then we B, for some Borel subgroup 
B of G and since B, is unipotent it is nilpotent and consequently has a 
positive dimensional center Z. If «€ Z, HM hence kK, 
therefore ZC K which is a contradiction. Therefore K =H 1 G, = {e} = Hp. 

We suppose then H is not finite, i.e., Hp ~ {e}. If x€ G normalizes H, 
then «€ H and so Hy is equal to its connected normalizer in G. If H, is 
unipotent it is nilpotent, hence is a Cartan subgroup of G) and G)—=H, so 
that G normalizes Ho, hence G/T contrary to assumption that H is a 
proper subgroup of G. Therefore a maximal torus S of Hy is ~ {e} and 
since 8 is contained in a maximal torus 7’ of G, and T centralizes S, TCH 
and T= 8. 

Let B, be a Borel subgroup of //) containing 7’ and B a Borel subgroup 
of G, containing B, Then BN H, is a solvable subgroup of H, con- 
taining B,, which is maximal solvable, hence B; = BO H,. Similarly, since 
(BN H) oC Ho, Bx =BN Therefore By, =ByO 

Let x€ B, normalize then if By, {e}, rHx H A {e} and I, 
so that B,, is equal to its connected normalizer in B,. Since B, is a connected 
nilpotent group, it follows that B,—B,, and since B=T7-B,, B=B,CH. 
Let g€ Go; then gBg" is a Borel subgroup of (@, and since any two Borel 
subgroups contain a common maximal torus, gH¥g?#M HA {e} and ge H, 
le. Hence Go =H, and G=H contrary to hypothesis. 

Therefore B,, = {e} and B, = T so that H, =T. Let K=HNG, and 
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suppose u€ K,, ue. Then we B, for some Borel subgroup B of G and 
since B,~{e} the connected center Z of ByA{e}. Let Z; since 
AN H, i.e. ZCH, hence ZCH,)—T which is impos- 
sible since Z is unipotent. Therefore K,—{e} and K—Kz,. Let s€ K,, 
se; then s is in some maximal torus g7Tg"! of G and so se HN gHg". 
Therefore g€ H and gTg*=T, i.e. s¢ Hy and K=HNG,=—H,. More- 
over Ng(H,) CH and so Neg,(H.) Go=Ho, hence Ne,(T) =T and 
the Weyl group of G, is trivial, i.e. Go is solvable. 

Let x€ G; «Tx is a maximal torus of G, hence eTx*=yTy™ with 
y€ Go, and ytx€ H so that Since Gy.NHCGNH=H =T, 
Go. H = {e} and 

By Lemma 1a, for all he H, he, op is injective on Gp, and so by 
Lemma 2, op is surjective on Go,. Let g€ M, g=hu, Go, hE H. 
If h Ae, u=on-2(go) and g —gohgo* which is impossible. Therefore h =e 
and M CG@ oy. Conversely, since Go. H = {e}, Gon U = {e} and 
GouCM, hence Go, = M. q.e. d. 

Proposition 1 gives the desired structure theorem for Frobenius algebraic 
groups in the case H not finite. For the remainder of this paper we adopt 
the convention: an algebraic group ( is a Frobenius algebraic group if it has 
a proper finite algebraic subgroup H such that for all x€ G, rHtx* N HF {e} 
implies x€ H. Proposition 1 then asserts HN Go = {e}. 


CoroLuary. If G is a Frobenius algebraic group then 


Proof. By Proposition 1 G) H = {e} and since G@, is normal in G, 


Therefore G,C tHx"} U {e} = M. 
q.e. d. 


3. A first structure theorem. The corollary to Proposition 1 asserts 
GoCM hence if M is a subgroup of G (as is proved later in Theorem 4) 
G)o—=M,. Therefore a first approach to the study of the structure of M is 
through the structure of G@). In this section we consider the special case 
where G, is assumed to be solvable. 


THEOREM 1. Let G be a Frobenius algebraic group and suppose (Cy 
is solvable. Then Gy ts nilpotent. Moreover, if dim G > 0 either Gy is a torus 
of dimension d=0 (mod p—1), in which case H 1s a p-group, or Go 1s 
unipotent, in which case the order of H is prime to p. 


Proof. Let 1 be a prime dividing the order of H and let h be an element 
of H of order 1. If dimG,>0 G, contains a characteristic subgroup A 
isomorphic to G, (the one-dimensional additive group). Since hAh*=4A 
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and no element of A centralizes h the map adh: A—A defined by adh(a) 
— hah induces a fixed-point-free automorphism ® of G,. Any automorphism 
of G,_ is of the form +— Az?” and since @! = 1 we must have (7) = Aw where 
and A~1, hence 1Ap. Therefore if dimG, > 0 the order of H is 
prime to p. 

Now let 7, be a maximal torus of Gy; then AT,h is also a maximal 
torus of Go, hence by conjugacy of maxima! tori hT,h"* aT yr for some 
v€ G@. By Proposition 1 and Lemma la, op is injective on G, and by 
Lemma 2, o, is surjective on Gy) so we have ton(g), g€ G. Therefore 
hT = o0n(9) Toon(g)? = oghgth+, i.e. if T= og, =T. 
If d=dimT and n is an integer prime to p the subgroup 7, of elements of 
T of order n contains exactly n* elements. Clearly h7,h-*—T, and no 
element of T centralizes H. It follows that n4==1(modl). Therefore if 
~ p, in particular 14=1 (modl) and so d=0, T = {e} and Gy, is unipotent. 
If H is a p-group, Go is a torus and n¢==1 (mod p) for all n prime to p 
implies d==0 (mod p—1). q.e. d. 


4, Fixed-point free automorphisms. An automorphism ® of a group 
(; is said to be fixed-point-free if the only element of @ left fixed by ® is 
the identity element. If G is a Frobenius algebraic group and he H, he, 
then (assuming M to be a subgroup) adh: e—hxh- induces a fixed-point- 
free automorphism of M/. Conversely, if an algebraic group M admits a 
rational fixed-point-free automorphism ® of prime order then, if H denotes 
the group generated by ®, the split extension of H by M is a Frobenius 
algebraic group. The following statements are translations of the corres- 
ponding results on Frobenius groups into the language of automorphisms. 
The proofs of Lemmas 1’, 2’ and 3’ are identical to the proofs of the corres- 
ponding results on Frobenius groups; it is clear that here we need no restric- 
tion on the order of ®. If ® is an automorphism of G, we denote ® the map 
of @ into defined by 

Lemma 1’. Let G be an algebraic group, ® a rational automorphism 
of G and ND N, algebraic subgroups of G, stable under ®, with N, normal 
N. Then 


a ® is injective on N if and only if ® is fixed-point-free on N; 
3 


b) if & is injective on N and surjective on N, then ® is injective on 
N/N,, i.e. if ® acts fixed-point freely on N and ® is surjective on N, then 
® induces a fixed-point-free automorphism of N/N,; 


c) if & is surjective on N, and on N/N, then ® is surjective on N. 
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LemMA 2’. Let & be an automorphism of the algebraic group G and N 
a solvable algebraic subgroup of G stable under © and such that ® acts fired- 
point freely on N. Then ® is surjective on N. 

Lemma 3’. Let G be an algebraic group admitting a fixed-point-free 
rational automorphism ® and N a solvable normal algebraic subgroup of ( 
stable under ®. Then ® induces a fixed-point-free automorphism of G/N. 


Lemma 4. Let G be a connected algebraic group admitting a fixed-poin!- 
free rational automorphism of finite period n. Then for all xe G, 

- -O(z)r=—e. 

Proof. For z,y€G we define yor=@(y)zry*. Then ani 
z0 (yor) = (zy)°ox so that G acts on itself as a transformation space. 
The isotropy subgroup of e consists precisely of the fixed-points of ®, hence 
is reduced to {e}. Therefore the orbit U of e is an open subset of G (in fact, 
U is a principal homogeneous space for G). If g«€ U we may write g = ®(y)y"', 
hence ®*-1(g)®"*(g)- - -®(g)g =e for all g€ U. Since U is dense in (¢/ 
the assertion of the lemma follows immediately. q.e. d. 


Lemma 5. Let ® be a fixed-point-free rational automorphism of « 
torus T. Then if the order of ® is finite, it 1s a power of p and if s is prime 
to p, &* is also fixed-point-free. 

Proof. Suppose ”=1 and let / be a prime different from p dividing 
n, say n=Im. Let {xe (x) =2}; then F, is a torus admitting 
a fixed-point-free automorphism of period 1p and so by Theorem 1 F is 
finite of order=1(modl). Let 7,=—T/F and T,,6"(z) =i}. 
If <€ T, is such that € F, then ©" - - 
= and so [Z16"(Z) —@, i.e. € F and since 
the order of F is =1 (mod/), 2*@"(x) € F, hence € F;. Therefore &” acts 
fixed-point freely on the torus 7,/F, so by Theorem 1 T, =F, i.e. 6" = 1 
on Therefore the homomorphism p: maps T into F and 
the image of p has dimension 0. Therefore T —kernel of p, that is to say 
©” —1o0n 7. Proceeding in this manner we finally have ©?” —1 on T for 
some power r of p. 

If (s,p) =1 then as+ dp"=1 and Therefore if 


we have r= = (zr) so r—e. q.e. d. 


Lemma 6. Let ® be a fixed-point-free rational automorphism of a uni- 
potent group G. Then ©?" also is fixed-point-free. 


Proof. Let F= {x€ G,&?"(x) =x}. Since G is unipotent, hence nil- 
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potent, so is the subgroup F and if = {e} then there exists an element 
re in the center of F. The group H generated by z,®(z),: (zr) 
is a finitely generated commutative group, hence a finite p-group. But H 
admits the fixed-point-free automorphism ® of period a power of p, hence 
must have order ==1 (mod p), which is impossible. Therefore = {e} and 
©?" is fixed-point-free. q.e. d. 


TurorEM 1’. Let G be a connected solvable algebraic group admitting 
a fived-point-free rational automorphism ©® of order n. Then tf n ts prime 
to p, G is unipotent and if n is a power of p, G is a torus. In particular, 
if n is a power of a prime G 1s nilpotent. Also, if the restriction of ® to Gy 
has order prime to p, G ts nilpotent. 


Proof. Let T, be a maximal torus of @; then ©(7,) =aT,a" and 
writing x=®(y)y* (Lemma 2’) the torus 7—y"T,y is stable under ©. 
Since G = 7'- G, the first assertion of the theorem is an immediate consequence 
of Lemmas 5 and 6. 

Now suppose the restriction of @ to G, has order s prime to p. For 
te T, we Gy, 8(tut-1) and so tut-? = 
i.e, (4@8(¢) centralizes G, for all ¢€ T. By Lemma 5, © is fixed-point-free 
on T and by Lemma 2’ every element of T is of the form ¢-*@*(¢). Therefore 
T is in the center of G and G=T X G, is nilpotent. q.e. d. 

We have seen in the proof of Theorem 1’ that there is a maximal torus 
T of G stable under © and by Lemma 5 the restriction of @ to T has order 
a power of p. If m is any integer prime to p the subgroup 7’ of elements 
of T of order m has order m* where ddim 7—rank of G. Since T 
admits ® fixed-point freely m4==1 (modp). Therefore we must have d=0 
(mid p—1). 


Corotiary. If the algebraic group G admits a fixed-point-free rational 
automorphism of finite order then G is solvable of rank d==0 (mod p—1). 


Proposition 2. Let G be an algebraic group admitting a fixed-point-free 
rational automorphism ® of period 2. Then G is commutative and ©® is 
inversion. Moreover, if p= 2, G is a torus and if p2, Gy is unipotent. 


Proof. By Lemma 4 for all x€ Go, =e, hence = 2" and G, 
is commutative. Applying Lemmas 2’ and 1’ we have @ is injective on G/G,, 
hence surjective on G/G, since G/G, is finite. Then by Lemma 1’c, ® is 
surjective on G and therefore for all x€ G, (1) x =e, i.e. ® is inversion and 
( is commutative. 

Since ® acts fixed-point freely, G has no elements of order 2 and so 
if p= 2, G, is a torus and if p2, is unipotent. q.e. d. 
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Lemma 7%. Let G bea connected algebraic group admitting a fixed-point- 
free rational automorphism ® of period 3. Then G is nilpotent. 


Proof. By Lemma 4 for all x€ G, 6?(x)®(x)a—e and hence also 
©? =e, i.e. =e. Therefore = 26(z) 
= (z)z, i.e. for all e€ G, x commutes with @(x). Let K be the subgroup 
of G generated by x and ®(x); K is a commutative group stable under 6 
and so by Lemma 2’, & is surjective on K. Hence © is surjective on G. 

Let B, be a Borel subgroup of G; then ®(B,) is also a Borel subgroup, 
hence by conjugacy 6(B,) Since is surjective, for 
some y€ G. Then it follows that B= y“*Bvoy is a Borel subgroup of G stable 
under 6. By Theorem 1’, B is nilpotent and hence G = B is nilpotent. 

q.e. d. 


THEOREM 2. Let G be a semi-simple algebraic group and ® a rational 
automorphism of G. Then ® has a non-trivial fixed-point. 


Proof. We assume ® acts fixed-point freely on G and arrive at a con- 
tradiction. The automorphisms of the semi-simple algebraic groups are known 
to be of the form ®(2) =az%a-' where o is an “outer” automorphism and is 
of finite order, o*" 1. When G is simple and r > 1 we may have either r= 2 
(type An, Dn, He) or r=3 (type D,). By repeated application of & we have 
(x) = where b = - -®(a)a—aa’: - -a%"", hence b% = 
Then ©(b) b and so b =e and 

Let G’ be one of the simple components of G; since the simple components 
of G are uniquely determined, 6‘(G’) is a simple component of G for each i, 
0Si<r. They cannot all be distinct for otherwise if 2€ G@’ the represen- 
tation -@(x)x=e given by Lemma 4 would imply ce. There- 
fore for some 1, Let d=(t,r) and a+br=d; 
then hence 64(G’) =G’. Choose d|r such that ©4(G’) = @’ and 
Gi’, ®(G’),- are all distinct. Let x€ G’ be such that 64(z) =2; 
then if y= - -@(x)a,@(y) =y and so ye which implies 
Therefore 4 is a fixed-point-free automorphism of G’ of order r/d. Replacing 
G by G’ and ® by ¢ we have reduced our considerations to the case of a 
simple group G. But in this case r= 2 or 3 and then Proposition 2 and 
Lemma 7 imply G must be nilpotent, which is impossible since G is simple. 

q.e. d. 

THEOREM 3’. Let G be a connected algebraic group admitting a fixed- 
pownt-free rational automorphism ®. Then G is solvable. Moreover, if ® is 
of finite order n prime to p, G is unipotent and if & is of order a power of p, 
G is a torus. 


428 


STRUCTURE OF FROBENIUS ALGEBRAIC GROUPS. 429 


Proof. Let R be the radical of G; R is a connected solvable characteristic 
algebraic subgroup of G. By Lemma 3’ the semi-simple algebraic group G/R 
admits a fixed-point-free automorphism so by Theorem 2, G=— FR is solvable. 
The remaining assertion of the theorem now follows immediately from 
Theorem 1’. q.e. d. 


5. A second structure theorem. Our first structure theorem (Theorem 
1) asserts that if @G is a Frobenius algebraic group such that G, is solvable 
then G, is nilpotent. The results of the preceding section allow us to remove 
the assumption of solvability. 


THEOREM 3. Let G be a Frobenius algebraic group. Then Go is nil- 
potent. Moreover, either Go is a torus of dimension d==0 (mod p—1) and 
H is a p-group, or Gy 1s unipotent and the order of H 1s prime to p. 


Proof. Let 1 be a prime dividing the order of H and let h be an element 
of H of order 1. Since hG,h-1—G, and no element of G, centralizes h the 
map adh: Go—G, defined by adh(g) =hgh™ is a fixed-point-free auto- 
morphism of G, of period 1. By Theorem 3’, G, is nilpotent and the assertion 
of the theorem is an immediate consequence of Theorem 1. q.e. d. 


6. The regular subgroup of a Frobenius algebraic group. Recall that 
if G is a Frobenius group we denote M = {G— |) rHz"} U {e} and a theorem 


of Frobenius asserts M is a normal subgroup of G in the case G finite. We 
are now in a position to extend this theorem to algebraic groups. 


THEOREM 4. If G is a Frobenius algebraic group then M is a normal 
algebraic subgroup, called the regular subgroup of G. 


Proof. Let p be a rational representation of G with kernel Go, G= p(@), 
H=p(H). Since G, is nilpotent (Theorem 3) Proposition 1 and Lemma 3 
imply G is a Frobenius algebraic group. Since G is finite, M is a normal 
subgroup of G. Let g€ G, g¢ M; then there exists hE H, he, and ré G 
such that and so g=p(g) € Moreover, since H 

= {e}, GN U = {e}, hence g¢ Go, i.e. G4 Therefore g¢ M and 

Suppose now g€ M; if @, then g—=p(g) M. If g¢G, and 

if g€ there exists G and he H, such that ha gre G. 
By Lemmas 1a and 2, 0, is surjective on G, and there exists go€ Gp such 


that ha-*gr = o4(go) =hgohgo, i.e. g =agohgor-? € tHa which is 
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impossible since g€ M. Therefore ¢ i.e. M and MCp"*(M), 
aeG 
Consequently M =p (M7) and therefore M is a normal subgroup of G. 
q.e. d. 


THEorEM 5. Let G be a Frobenius algebraic group and M the regular 
subgroup of G. Then G=H-M, M, ts nilpotent and either M=M, and 
H is a p-group or M, is unipotent and the order of H ts prime to p. In any 
case, M, and M/M, are nilpotent and so M 1s solvable. 


Proof. By Theorem 3, G, is nilpotent and so by Lemma 3, if p is a 
rational representation of G with kernel G, then p(G@) is a finite Frobenius 
group (with regular subgroup p(//)). In the finite case then we have easily 
p(G) =p(H)-p(M). By the corollary to Proposition 1, G)CM and it 
follows that G—=HM; since HV M={e}, G=H-M. Also since 
M,—=G, and the next assertion follows from Theorem 3. By Thompson’s 
theorem [4] //M, is nilpotent. q.e. d. 


7. An application. As a consequence of his theorem Thompson was 
able to prove a theorem concerned with maximal subgroups of a finite group 
[4]. The following is an extension to algebraic groups. 


THEOREM 6. Let G be an algebraic group and suppose that one of the 
maximal subgroups M of G is nilpotent. Then G, is solvable. If [M: My| 
is odd, then G is solvable. 


Proof. If M is nilpotent so is M (smallest algebraic group containing 
M) and since M is maximal either M or M —G; therefore we may 
assume .V/ to be an algebraic subgroup of G. Since MCNg(M,) (normalizer 
in G of M,) either M, is normal in G or M—WNg(M,). Suppose first 
M=Ne(M,); then MDNeg,(Mo) =MN hence M, is a nilpotent 
subgroup of G, equal to its connected normalizer in G). Therefore M, is a 
Cartan subgroup of G, and is maximal nilpotent; since MA G, is nilpotent 
either MM Go = G, in which case G, is nilpotent, or else, as we now assume, 
(centralizer in of where T is a maximal torus 
of Go. Note and if Ne(T) =G, Ne, (T) =Ne(T) 1G =% 
hence 7 is normal in G, and G, is nilpotent; we assume then J/ = N¢(7'). 
hence Mp = MN G,=Neg,(T) = Og,(T) so the Weyl group of G, is reduced 
to {e}, hence G, is solvable. 

Now suppose M, normal in G; by considering a rational representation 
of G with kernel M, we are reduced to the case M finite. We proceed by 
induction on the order m of. M, the case m —1 being trivial. In particular. 
the induction hypothesis allows us to assume if N+ {e} is a normal sub- 


| 
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eroup of M then Ng(N)=—M. Since MNG.—N is normal in JM, if 
VA we have Ng(N)=M and Ne (N)=MNG—=N. Since N is 
nilpotent its center Z(N) A {e}; if ce Z(N), tHe, Ce(x)C M hence 
(,(2) =N. If se is a semisimple element of Z(N) then se€T, T a 
maxmal torus of @G) and 7CCq,(s) =N which is finite; therefore T = {e} 
and G, is nilpotent. If we is a unipotent element of Z(N) then w€ By, 
B a Borel subgroup of G, and Z(By)CCe,(u) =N; therefore By= {e} 
(since it is a connected nilpotent group and if dim B, > 0, dim Z(B,) > 0) 
and B, hence Go, is a torus. 

Finally then we may assume MM G,=N = {e}. Let H=Z(M); since 
WV is nilpotent HA{e}. Let G’=G,H and suppose x€G’ such that 
OH A say Go, h€ H and there exists h’€ H, h’ e, 
such that H. Then Gon H = {e}, hence h’*gh’g* =e, 
ie. gh’g?=Wh’ and g€ Cg(h’) CM. But G, and M = {e}, so that 
g=eand«—h¢€H. Therefore G’ is a Frobenius group and by Theorem 3, 
= G, is nilpotent. 

Now let p be a rational representation of G with kernel Go, p(G@) =G, 
p(M) =. Then G is a finite group, M a maximal subgroup of @ which is 
nilpotent, and the order of M divides [M: M,]. Therefore if [M: Mo] is 
odd so is the order of and it follows from [4] that G is solvable. Then G, 
being an extension of a solvable group by a solvable group, is solvable. 

q.e. d. 


Actually the structure of G—=G/G, can be completely determined also 
in the case [M: M,| even because Thompson has determined the structure 
of all finite groups having a maximal subgroup which is nilpotent [5]. 


CORNELL UNIVERSITY. 
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A FAMILY OF SIMPLE GROUPS ASSOCIATED WITH THE 
SIMPLE LIE ALGEBRA OF TYPE (G.).* 


By Rimuak 


Introduction. In this paper we construct a family of simple groups. 
The family contains finite as well as infinite groups. The order of the finite 
groups in the family are 


where g = 3?"*1, n=1,2,3,---. 
By using the fact that the above orders are divisible by 8 but not by 16, one 
can show easily that none of the simple groups listed in [1] or [4] has any 
of the above orders.* 

The construction is carried out by applying to the Chevalley groups of 
type (G.) a method which emerges naturally when one interprets Suzuki’s 
[5| construction of his simple groups from a Lie theoretical point of view. 
The Lie theoretical interpretation of Suzuki groups is as follows. Let g be 
the simple Lie algebra of type (B.) over the complex number field, and 

={+ta,+b,+(a+ )),+(2a+ 6)} the set of roots of g. For an arbi- 
trary field K, define the algebra gx over K and the automorphisms 7,(1), 
where r€ 3, ¢€ K, of gx as in [2], and let G be the group generated by 
all the z,(¢). Now assume that K is a field of characteristic 2 which admits 
an automorphism ¢— #9 such that 2671. By using a representation of ( 
(given, for example, in [3]) one can show easily that the matrix S(t, w) 
given by Suzuki in [5] corresponds to «(t)8(w), where 

a(t) = ay (t) (t?*) ; = Lary (t) 

The above expressions lead one to look for an automorphism o of G such that 
the elements a(¢)B(u) are exactly those elements in UW (the subgroup of & 


generated by all z,(¢) with > 0) which are left invariant by o. It turns 
out that there exists indeed such an automorphism of G. It is given by 


* Received September 19, 1960; revised February 21, 1961. 

* This work was done while the author held a Research Associateship, 1959-1960, 
of the Office of Naval Research, U. S. Navy. 

The author wishes to express gratitude to J. Tits for his persual of the manuscript 
and many valuable suggestions. 

? See Theorems 8.3 and 8. 5. 
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Denoting by B the group generated by all 2,(t) with r < 0, let 1 (resp. B*) 
be the group of elements in U (resp. 8) left invariant by o. Then the 
croup G' generated by U* and ¥? is simple when K has more thai 2 elements, 
and yields Suzuki groups when K is finite. 

The Chevalley group G of type (G.) happens to have a similar auto- 
morphism when X is a field of characteristic 3 which admits an automorphism 
tt? such that 36°?=1. Hence we can define the group G* in exactly the 
same way as above. As one of the main purposes of this paper we shall show 
that the group G* is simple when K has more that 3 elements. 

In the case of Suzuki groups, G* turns out to be the group of all elements 
in G@ fixed by o. For the groups of type (G2) the same holds true when the 
field K is finite, but it is unknown whether or not one can, in general, dispense 
with the finiteness condition. Some efforts in this direction may be found 
in Section 8. 

In the last section, we show that any automorphism of the finite simple 
group G of type (G.) can be written as the product of an inner auto- 
morphism and an automorphism induced by an automorphism of the ground 
field. The author has been unable to extend this result to the case of infinite 


groups.® 


1. Chevalley groups. Although we shall use later only Chevalley groups 
of type (G2), we treat here the Chevalley groups in general, since it does 
not make the exposition more complicated. 

Let = be a simple root system. We fix an order in & so that we can 
talk about positive, negative, and fundamental roots. Let P (denoted in [2] 
by P,) be the additive group generated by %. For r, s in & define the integer 
s(r) by s(r) = p—gq, where q (resp. — p) is the maximum (resp. minimum) 
integer 7 such that s+ir is a root. Then for each r€ 3, the map s—>s(r) 
can be extended to a linear map on P?. The symmetry w, with respect to 
r€ X is the permutation of & defined by w,(s) =s—s(r)r. The Weyl group 
W of 3 is the group generated by all the w,, r€ &. 

Now we fix an arbitrary field K, and denote by K* the multiplicative 
group of K. For r€& and z€ K*, define the homomorphism x,,,;: P— K* 
by X,2(u) 2". Denote by XY the group generated by all the x,,-, where 
r€3,z€K*. It can be shown that w(s)(w(r)) =s(r) for any r,s€ = and 
w€W. From this it follows that xow€ Y for any x€ X and we W. It is 
shown in [2] that, for any pair of § and K, there exists a group @ (denoted 


*J. Tits remarks that the finiteness condition seems to be dispensed with if one 
uses his geometric interpretation of the group G* given in [7]. 
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by G’ in [2]), which we shall call the Chevalley group of type & over K, 
and elements 2,(t), where r€ 3, ¢€ K, in G such that the conditions (1. 1)- 
(1.8), below, are satisfied : 


(1.1) Gis generated by the elements 2,(¢), where r€ 3, (€ K. 
(1.2) For any two positive roots r and s in 3, 
we have 


where the product is taken over all pairs (i,j) of integers such that ir + js 
is a root, the pairs being arranged such that the roots ir+ js form an 
increasing sequence, and where C;;.,,, are certain integral constants depending 
only on = (not on K). 


(1.3) For each r€ 3, there exists a homomorphism ¢: SL(2;Kq>G 
such that 


1 0 1 ¢ 
r_,(t), pr ¢ a(t), (t€ K) 
(1.4) There exists a homomorphism h: X > G@ such that 


z 0 
h (Xr,2) = (; 
for all r€ & and z€ K*. 


(1.5) For any x¢€ X and 7r€ we have 


h(x)2,(t)h (x) = 2,(x(r)e), We 
(1.6) For 
0 1 
we have 
= (8) (nr,st), (sex, K), 


where 7,,, are certain integral constants + 1 depending only on = (not on &). 
(1.7%) For any -,7m€% and any x€ X, we have 


oh fj (x’) 


where 


= Wr, 


—- 


K, 


js 
an 


hg 


A FAMILY OF SIMPLE GROUPS. 


and where x’€ X is defined by x’(s) =X(w1(s)) with 
W = * 


(1.8) +, and X satisfy ys —x(s) for all s€ and if 


m 

then 

Wy,’ == h(X). 
Here 7, are integers +1 defined from 7,,: * -,1m by 

where 


Note that the above properties (1.2)-(1.8) are all expressed in the form 
of “relations” of the generators z,(¢). Therefore, by using the constants 
(ij. and y,,s Which arise from a Chevalley group, we can consider a group 
( abstractly defined by the generators and the relations (1.2)-(1.8). In 
order to distinguish this group G from the underlying Chevalley group which 
gives rise to the constants C;;.,,, and y,,s, we shall denote the latter by G, and 
let c—> # be the natural homomorphism of G onto G. We shall show, in the 
rest of this section, that x—Z is an isomorphism. First we shall derive 
some consequences of (1.1)-(1.8). 


(1.9) If r——a, where a is a fundamental root, and if s is a positive 
root, then we have an identity of the form given in (1.2). 


Proof. Since a is fundamental, wz maps every positive root a upon 
a positive root. Applying (1.2) to a and w,(s), transform the resulting 
identity by ,, and rearrange the factors on the right such that the roots 
form an increasing sequence. Then by (1.6) we have the resired result. 


(1.10) For any r€ 3, we have o_o, —1. 


Proof. Since 


“DE 2) 
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we have 


IE 


(1.11) If satiafy 
Wr, Wr, =1, 


then there exists x€ X such that x(s) 7, for all s€ %, where 7, are the 
integers defined in (1.8) from 


Proof. Let 


Then by (1.6) and the definition of »; we have 
(t) wo = £5 (net) 
for all ¢¢ K. Applying the homomorphism x— <Z to the above, 
(t) = (net). 


On the other hand, it is known ([2], Lemma 3, p. 37; Lemma 4, p. 49) 
that (*) implies that o—h(x) for some x€ X. Then 

Hence we have x(s) =n, for all s€ &. 

From (1.8) and (1.11) we obtain 

(1.12) If7,- - -,1m€ satisfy the condition (*) in (1.11), then for 
some x€ X we have 

Wr," Or, == h(x). 

Denote by U (resp. ¥) the subgroup of G@ generated by the 2,(¢) with 
r>0O (resp. r<0), and by § the group generated by all the h(x), x€_V. 
For each W denote by the group generated by all the r,(t), K, 


such that r > 0 and w(r) <0. Also for each w€ W, choose once and for all 
roots 7;,° *,?m€3 such that 


and set 


( w) * 


Then we have the following fundamental proposition : 


W= Wr," Wrm 


A FAMILY OF SIMPLE GROUPS. 437 


(1.13) Every element in G is written uniquely in the form uho(w)w’, 
where w€ hE S, we W, and w’€ Up. 


First we shall prove the uniqueness part of (1.13). Suppose whw(w)u’ 

= uhywo(w,)u,’ with u,€ U, € wie W, and u,’€ Up, Then tha (w) 

= Then by [2], Theorem 2, we have #=%, h—h,, 

if =%,’. By (1.2) we can write wu and wu, in the form [[2z,(t,), where r runs 


over all positive roots arranged as an increasing sequence and where /,€ K. 
Then by [2], Lemma 6, p. 39, 7%, implies that ¢, for u and wu, coincide 
for all Hence Similarly, u’—w,’. Since w—w,, we have 
h—=h’ also. Thus the uniqueness part of (1.13) is proved. 

Before continuing the proof of (1.13) we shall make a definition. For 
any element » of the form 


(1.14) = Or, "Orn, 


define by 


It can be easily seen from (1.6) and the uniqueness part of (1.13), above, 
that £(w) is independent of the mode of representing o. 

Now, a process of putting the given element x in G in the form of (1.13) 
is given in [2], p. 38-40. The properties (1.1)-(1.9) are sufficient to carry 
out the process. Since this process is needed later, we shall describe it in 
detail for the convenience of the reader. It consists of the three steps (1.15)- 
(1.17), below. 


(1.15) Step 1. We express the given element x as 222: + ‘Xp, where 
each factor 2; is either of the form 2.,(¢) or of the form o., with a funda- 
mental root a. 


This step may be divided into the following 2 parts: 


(i) Write the given element x in the form [[2z,(t,). 


(ii) For each non-fundamental 7r€ % appearing in (i), above, choose 
w€ W and a fundamental root a such that r= w/(a) ; find fundamental roots 
* such that 

W == * 


and replace a,(¢) in (i) by 


— 
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(1.16) Step 2. We express the given element z in the form 
(*) t= uhow’ 


where u, uw’ € Ul, hE § and o is an element of the form given in (1.14). 
This step is carried out as follows. Let x=—2,-- -@ as in (1.15), 
and set 


Yo = 1, Y1 = Yo = * Yn = 


Then y, is trivially in the form (*). Suppose y;—=whow’ is in the form (*), 
If Yiss = a(t) y; with a fundamental root a, then y;,; is also in the form (*), 
Tf Yisr = OF if yi, then write uw appearing in y; in the form 
U=U,%,(t,), where u, is a product of some 2,(t’) with r>0, ra. Then 
by [2], Lemma 8, p. 40, yi: can be written in the form wu hvou’, where 
VE da(SL(2;K)). Set Two possibilities arise: w*(a) < 0. 


Case 1. wi(a)>0. By [2], p. 34, write v as v=a,(t’)h or 
VU where h€ Then it can be seen easily that y;,, can 
be put in the form (*). 


Case 2. wt (a) Use == Wqw instead of wo. Set 1). 


Then w,(a) <0. Hence we are in case 1. 
Thus, proceeding inductively, we can put y¥,—w in the form (*). 


(1.17) Step 3. We finally express the given element 2 in the form 
specified in (1.18), starting from the expression x = uhww’ obtained in (1.16). 
By making repeated use of (1.2), if necessary, we express wu’ € U1 as wu’ = Uy ts, 
where uw, is a product of 2,(¢) such that r>0, w(r) >0, where w—€(w), 
while u.€ l,. Then by (1.5), (1.6), and (1.7), we can write += ushou, 
with u;€ Also, by (1.12), o=—hw(w) with h,€ Thus (1.13) is 
proved. 

Now let x be any element in the kernel of the homomorphism «- <, 
and write z= uhw(w)w’ as in (1.13). Then arguing as in the proof of the 
uniqueness part of (1.13), one can show easily that ¢—1 implies e=—1. 
Hence the map x @ is an isomorphism of @ onto G. Thus we have proved 


THEOREM 1.18. For any simple root system & and any field K, the 
Chevalley group G of type & over K 1s characterized by the properties (1.1)- 
(1.8).* 

The constants Cj;;,,, and y,,, are not uniquely determined by &. If, how- 


‘For another characterization of Chevalley groups by generators and defining rela- 
tions, see a forthcoming book by J. Tits. 
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ever, we have, in the simple Lie algebra of type % over the complex number 
field, a set {X,|r€%} of root vectors which satisfies the conditions in 
Theorem 1 of [2], then we can obtain a set of the above constants by setting 
r,(t) =exp(tadX,) and by the relations (1.2) and (1.6). The constants 
ys can actually be defined by ([2], p. 36) 


( 19) 5 nr,sX w,(s) 
We have 


(1. 20) 1, Yr,-s = = Yr,we(s)> 


the first of which follows from (1.19) and the relation , = 2,(1)z_,(— 1)2,(1), 
the second from [2], p. 36, and the third from (1.9). Also from (1.9) 
we have 


(1.21) If r,s€% are distinct, and if none of r+ s, r—s are in &, 
then = 1. 

For r,s€ such that r+s€3%, define the number by 
=N,,Xy,;. Then from (1.19) we have 


(1.22) Ifr,s€ and if +r, +s, + are the only linear com- 
hinations of 7, s which are in &, then = 


(1.23) For any r,s,w€ such that s+ 3, we have 


2. The root system of type (G.). The rcot system & of type (G.) 
can be given as 


{+ & &—§| t, = 1,2,351 9}, 


where €, + € + &,—0. The Weyl group W of & is of order 12, and consists 
of elements w such that w(&) = += 1,2,3, and e—=+1, and where 
z isa permutation of 1,2,3. We shall write w= (e,7). Then w,= (1, (17) ) 
for r= &—&;, where (ij) denotes the transposition of 1 and j, and w, 
= (—1, (77”)) for r—=-+ &, where 1, 7’, i” is a permutation of 1, 2, 3. 
The relationship between the roots and the action of W on & can be 
seen easily in Fig. 1, where & is represented as a system of vectors on a 
Euclidean plane. The element w,€ W is nothing but the symmetry with 
respect to the line which passes through the origin and which is perpendicular 
to the vector representing the root r€ 3. The function A(7) on & defined by 


A(+ 


i) =1, A(&— =3 


€ 
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for all 1,7 =1,2,3; 1-47, has a clear meaning in Fig. 1. 


FIGURE 1. 


Since w,(s) =s—s(r)r, one can see easily that 
(2.1) (r,s) = 4A(s)r(s) 
coincides with the Euclidean inner product of the vectors 7,s€ %. 


Denote by P, as in Section 1, the additive group generated by 3, and 
define a homomorphism y: P— P by 


2.2) =&—&:, W(é.) =&—&. 
Then one can see easily that 
(2.3) =A(r)F 


holds for all r€ 3, where r—># is a permutation of order 2 of = such that 
(2. 4) —r=—f, =3 


for all r€ 3. From the definition of y, it is clear that y— V3 ya, where ys 
is the symmetry with respect to the line 8 bisecting the angle between ¢, —&; 
and —é, in Fig. 1.5 Hence (¥(r),w(s)) =3(1,8), which yields, by (2.1), 
(2.3), and (2.4), 

(2.5) A(r)F (5) =A(s)r(s) 


for all r, s in 


5 The idea of considering the line 6 in connectior with the map y was suggested to 
the author by J. Tits. 
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Applying y to w,(s) =s—s(r)r, we get 
y(wr(s)) 
by (2.5). Since A(w,(s)) =A(s), we have from the above 
(2.6) w,(8) = wr(5) 


for all 7,s€ &. For each w€ W, define a permutation @ of & by 


(2.7) (5) =w(s). 
Then from (2.6) we have immediately 


(2.8) The map w—@ is an automorphism of the Weyl group W such 
that wy for all r€ 

We shall consider elements invariant under the automorphism w— w. 
Suppose w= (e,7) satisfies ®—w. Then from (2.7) we obtain 


from which one can deduce easily that z is the identity permutation. Hence 


(2.9) If then or where wy is defined by 
=—r for all r€ &. 


3. Chevalley groups of type (G.). We shall determine a set of the 
constants C;j;,,,; and y,,; for the root system = of type (G.). The simple Lie 
algebra gq of type (G.) has a basis 


(3. ] ) X ij; Aw X ois Hi, 


> 


where 1, = 1, 2,3; 17, and where 


(3. 2) H,.+ H.; + Hs, =0. 

The multiplication table is 
= Xu, [XioX oj] = 38Xij, 
= jo] = Xio, 

(3.3) [X jo | 2 (17k) X ox, oj; ] =— 2 (17k) 
[XyX = Hiy, [XoXo] = + Hin, 
[XinHi;| Xio, [X | X ois 
2N [ XijH jx [Xi xi] = X ij, 


Where 7, j, & run over distinct values of 1, 2, 3, and where (ijk) =1 or —1 
according as 7, j7, k is an even or odd permutation of 1, 2, 3. All the other 
products of basis element which cannot be computed trivially from (3.2) 


and (3.3) are zero. 


44] 
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The elements H;; span a Cartan subalgebra h of g, and Xi, Xio, Xo; are 
root vectors with respect to h, corresponding respectively to the roots é;— é,, 
&, —é& given in Section 2. These root vectors satisfy the conditions in 
Theorem 1 of [2], and hence the table (3.3) can be used to compute a set 
of the constants and 

The constants 7,,, can be computed easily by (1.19)-(1.23), and by the 
fact that any even permutation of 1, 7, k leaves the table (3.3) invariant. 
The result of the computation is give in table (3.4). 


(3.4) The table of 7,5 


—1 —1 1 1 1 

1 —1 —l 1 —1 1 

Es 1 1 1 —1 

| —1 1 1 —1 

1 1 —-1 1 

—- 1 1 1 —l1 


All the other value of 7; which are not found in (3.4) can readily be 
supplied by (1.20). 

As an immediate application of (3.4) and (1.20), we can verify the 
equality 
(3.5) = 


for all r,s€ 3%, where r—F is the map defined in Section 2. Also, using 
(3.4) and (1.20) we can show that 


Wo = WE, 
satisfies 
(3. 6) (t) wo = (== t) 
for all € and ¢t€ K, and hence 
(3.7) wo”? = 1. 


Similarly, using (3.4) and the symmetry of the table (3.3), and noting that 
there can be only one element w)€ G satisfying (3.6), one can show that 


(3. 8) Wo == WE = WE;-E, WE, 


W_£,WE;-£; —= WE,-E,0-£, 


holds for any even permutation i, 7, k of 1, 2, 3. 
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Now, in order to consider the constants Ci;;r,s, we shall fix an order in 
Set a—&, Then all the roots are +a, +b, + (a+ 5), 
+ (2a+b), + (8a+b), + (3a+4+ 2b). We can regard a,b,a+6,:- to 
be positive. Then the fundamental roots are a and 6. For the map r>7 
defined in Section 2, we have 


(3. 9) a—b, (a+b) =3a+6, (2a+b0) —38a-+ 20. 


Now, the constants C1;:-,, and Ci1;,, can be computed easily from (3.3) 
using the formulas given in [2], p. 36, but Cs,2;2,, must be computed directly 
from (3.3). We shall express the results of our computation in the following 


form: 
(ra(t), Carv(U)) = tu) 
(3. 10) (ra(t), Loarp(U)) = Lsa+p(3tu), 
(to(t), == 


where (2,y) =a y"a2y. For all the other pairs of positive roots r, s, we 
have (2,(¢),2.(u)) = 1. 


4. An automorphism of G. In this section we assume that K is a 
perfect field of characteristic 3, and that ¢—> ¢ is an automorphism of K. 

Let = be the root system of type (G.). Using the notation introduced 
in Section 2, define x for each XE X, by 


(4.1) X(7) (7) = 
Then one can show easily by (4.1) and (2.5) that 
(4.2) 
for all r€ & and z€ K*. 
Let G be the Chevalley group of type (G.) over K. Set 
(4.3) = 
Define the homomorphism ¢,: SI(2;K)—G by 


Define the homomorphism h: X > G@ by 


(4.5) h(x) =h(X). 


| 
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We shall show that we can replace z(t), $,, h in (1.1)-(1.8) by Z,(t), 
¢r, h, respectively, without losing the validity of the statements. Denote by 
(1.n’) the statement obtained from (1.n) by the replacemeut. Then (1.1’) 
is clear. (1.3’) follows easily from (4.3), (4.4) and (2.4). (1.4’) follows 
from (1.4), (4.2), and (4.4). (1. 5’) follows from (1.5), (4.1), and (4.3). 


As for (1. 6’), w, is replaced by w;, since 


Then (1.6’) follows from (1.6), (4.3), (3.5), (2.6), and the fact that 
A(w(r)) =A(r) for all we W and re. As for (1.7’), w is, by (4.6), 
replaced by 


Hence, by (1.7) and (2.8), we have oh(x)o+—h(x,), where x, is defined 
by X1(s) =X(m@1(s)). Then, by (4.1), 


X1(s) ) 
= X(w-t (5) ) ¥’(s), 


where denotes for r€ Therefore, x: =x’. Now, (1.7) follows from 
(4.5). Also it can be seen that (1.8’) follows from (1.8), (2.8), (3.5), 
and (4.1). It remains to prove (1.2’). Since K is of characteristic 3, 
(3.10) now becomes 


(4. 7) (a(t), = Toasn(tw), 
Tsar0(U)) = Tsar20(tu), 


and all the other commutators of the form (z,(t¢),7,(u)), where r > 0, s > 0, 
are 1. Since the set of pairs {a,b}, {a,a+ 0}, {b,3a-+ 6} is invariant under 
the map {r,s} — {7, 5}, it is clear, in view of (4.3), that (z,(t),a,(u)) =1, 
r>0, s>0, implies (%,(t),Z,(w)) —1. Since 


A(a) =A(a+ —=A(2a+ 6) —1, 
A(b) =A(8a+ =A(Ba+ 2b) =3, 
we have by (3.9) 
(Ta(t), Ty(w)) = = 


= t?u)*) rar» 
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Thus it is shown that the first equation of (4.7) remains valid under the 
replacement 2,(¢)—>,(t). Similarly the rest of (1.2’) can be proved. 
Thus we have proved (1.1’)-(1.8’). Then from Theorem 1.18 and the 
simplicity of G, we obtain 


THEOREM 4.8. If K is a perfect field of characteristic 3, and if t— t? 
is an automorphism of K, then the Chevalley group of type (G2) over K 
admits an automorphism x— 2x such that 


( t) ( 


forall r€ X and t€ K. The automorphism o maps U, §, B onto themselves, 
respectively, and 
h(x)? =h(x), Wr? = wF 


for all xE X and r€ &, where x is defined by (4.1).° 


5. Invariant elements. From now on we assume that XK is a perfect 
field of characteristic 3, and that the automorphism 6 satisfies 


(5.1) 36? = 1, 


the right hand side of (5.1) being the identity automorphism.‘ 
Before considering the group G, we shall derive a few properties of the 
field K from (5.1). Since 


(280+1) 30-1 (230-2) 3642 


we have 


(5.2) Every element in K*? is of the form 2°71 with z¢€ K*. 

(5.3) Every element in K* is of the form z**? and also of the form 2°. 

(5.4)® —1 has no square root in K. 

Proof. Suppose Then (2’)?—=—1, and hence 
(296)? — (— = (= 2)?’ =—-z, 


but this is impossible. 


® After the completion of earlier versions of this paper and [9], the author learned 
from Tits that the existence of the automorphism o was already mentioned in [6], 
p. 282. See also R. Steinberg [8]. 

7It was discovered by Tits that K need not be assumed to be perfect if one con- 
siders an endomorphism 7, instead of 0, of K such that 7? = 3. For the details of this, 
see [7]. 

° This lemma was pointed out to the author by R. Steinberg and J. Tits, indepen- 
dently. The proof given here is due to Tits. 
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An element x in G will be said to be o-invariant if x7 =z. First we 
shall consider o-invariant elements in U. Suppose 


La(tio) Lp ( tor ) La+b ( ) Learb ( tor ) Lsa+b (ts2) 


is o-invariant. We have 


(t10°”) La ( tor") ( L3a+2b ( Lard (ts19) Lea+b 


By (4.7), the right hand side of the above can be changed into the form 
* Then comparing with z, we get 


By (5.1), the equations in the first row, above, are equivalent to the equa- 
tions in the second row. Hence if we set then 
r=a(t)B(u)y(v), where 


a(t) = (t) Lary (t%*) (471), 
B(t) = Tain 


Hence, by [2], Lemma 6, p. 39, we have 


(5.5) The group UU of o-invariant elements in U contains a(t), B(t), 
y(t) for allt¢€ K. Every element in 11* is written uniquely as a(t) B(w)y(v) 
with ¢, u, ve K. 


By using (4.7) one can show easily 


a(t)a(w) = a(t + w)B(— tu) y(— Pu + tu), 
B(t)a(w) =a(u)B(t)y(—tu), B(t)B(u) =B(t+u), 
y(t)a(w) =a(u)y(t), y(t)B(u) =B(u)y(t), 
y(t)y(u) =y(t+4). 


It can be shown easily from the above that the group 11' is generated by the 
elements a(t), K. 
Before considering o-invariant elements in %, we remark that the element 


(5.6) 


wo given in (3.8) can be written 
(5. 7) Oo = Ma+b%3a+b 


and hence is o-invariant by (3.9) and Theorem 4.8. Now define the elements 


a(t), BY(t), y/(t) by 
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a” (— t) = (t) wo 


Then from (5.5) and the o-invariance of w) we have 
(5.8) The group of o-invariant elements in contains «’(t), B(t), 
/(t) for all t€ K; every element in ¥% is written uniquely in the form 


a’(t)B’(u)y’(v) with t,u,ve K. 
Denote by G* the group generated by 11" and %1. Since 


—1 \O 1/\—1 1/\0 1 

and since any one of (t1), Varo(t2) commute with any one of 
(tg), we have from (5.7) 

(5.9) wy = B(1)8’(— 1) B(1) € G@. 

Now we shall consider o-invariant elements in §. We have h(x)? =h(x). 
Since a and b generate P, x is determined by x(a) and x(b). From (4.1) 
we get 
(5. 10) X(a) =Xx(b)*, x(b) =x(a)™. 

Hence we have 
(5.11) h(x) is o-invariant if and only if x(a) =x(b)?. 
By (1.5) and the definition of the elements a(t),- - -, we have 


(5.12) If h(x) is o-invariant, then 


h(x) a’ (t)h(x)-? = @’(x(b) +4), 
h(x) B(t)h(x)* = B(x(b)°*t), h(x) B’(t)h(x)* = B’(x(b) 


Denote by §* the group of all c-invariant elements in §, and by $° 
the subgroup of §* consisting of all o-invariant h(x) such that x(b) = 2°? 
with some z€ K*. Note that some elements, for instance —1, in K cannot 


be written in the form 29%, Since 


(0s) = (eta i) 


and since any one of 2q.»(t1), @(aiv) (t2) commute with any one of %sa,9(ts), 


T_(sa:v) (ts), we have 
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0 z 0 
= Pa+b (| Psa+b a = h (Xa+0,20) 
Set 
X= 
Then it follows from the above that h(x) € Gt. We have 


Since z is an arbitrary element in K*, we have 
(5.13) 
Now we shall determine all o-invariant elements in G by (1.13). In 
view of (3.6), we can assume, for the w, given in (2.9), that 


(5. 14) w(Wo) = Wo. 


By (1.13) we can write any element x in in the form = uh(x)o(w)u, 
where w€ U and wu, is a product of elements of the form z,(t) with r> 0, 
w(r) <0. We have 27 If 


o(w) * 
then by Theorem 4.8 we have 

w(w)? 
and hence by (1.12) and (2.7) we can write w(w)?=h(X:)o(@). Then 
(5.15) = Uh (XX;)0(W) 
Here wu,’ is a product of elements of the form 2,(t), r>0, w(r) <0. We 
have #>0, and #(#) =w(r) <0 by (2.7). Hence the expression for 2° 


given in (5.15) is exactly in the form specified in (1.13). Assume that x 
is o-invariant. Then «2%, and the uniqueness in (1.13) implies 


Xx=XX1, 


Then by (2.9) we have w=1 or w—w,. Then, since we can assume 
w(1) =1, we have from (5.14) o(w)*=w(w), and hence h(x,) =1, X,=1. 
Hence X =X and h(x) is o-invariant. Thus we have proved 


(5.16) Any o-invariant element x in G is written uniquely in the 
form x—wuh(X)ou,, where: x =X; o—1 or w3 if o—1, 
but uw, is an arbitrary element in 11° if » ~wp. 


| 
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The form of x given in (5.16), above, will be called the canonical form 
of 

6. Some identities. We shall compute the canonical forms for «’(t) 
and y’(t), by following, more or less, the steps (1.15)-(1.17). The results 
are crucial, as will be seen later, in the discussion of the structure of G"'. 
Note that our assumption on the field K is the same as in the preceding 
section. 

(6.1) f t€ K*, then 

where x is defined by x(a) =—t-*8, x(b) =— 

Proof. We have 


By applying dea.» to 


we obtain 
L_(2a+b) — (t- 2) h (x1) , 
where 
X1 = Xea+b,-t- 20+). 
By transforming the second equation of (4.7) by wa», we obtain 
Hence we have 
(Xs) 

where 


X2 = 


since, by transforming the second equation of (4.7) by wa, we obtain 


Now, by applying ¢a to 


| 
| 
| 
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we obtain 
(t9) rq (t-*) = h(X3) 
where 


X3 = Xa,-1° 
Hence we have from the above 


SINCE by (3.8). Since 


La ( t?) h (X2) ly, ) Va ( h(x, © Woas) 
h (Xe) “*h (Xi © ( Lp ( ) (— {-(6+1) ) ( {-(26+1) ) 


we have 


a(t) = earn 2) (— t*) (— h(x) 
X = XsX2(X1 


From this one can derive (6.1) easily. 
(6.3) If t€ K*, then 
y(t) = y(t) a 7h (x) opa(— y (#7), 
where yx is defined by x(a) = t?-*, x(b) =. 


Proof. We have 
= (t9) (30420) (t). 
From (6.2) we have 
T_(204+2b) (t) = Tsa+2b ( (X; e100 


where 


X1 = Xsa+2b,-t71 
On the other hand, we have by (1.6) and (1.10) 


U_(9a+b) (77) = (t9) 


Hence we have 


( t) = ( T3a+b ( ) alt (X1) 30420 (t*) ° 
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By transforming the first equation in (4.7) by wa, we obtain an expression 
of the form given in (1.2) for (#-a(t),%sa:.(u)). Combining this with 
(4.7), we get 
Hence we have from the above 
U_(a+b) (t°) wah (X1) 
From (6.2) we have 


where 


Xo = Xa,-t!-20- 


Hence 


* V_(3a+d) (— (t°)h (Xi 


SINCE w_qw3q12p =o by (3.8). Hence we have easily 
=y(t*) a (x) 9% (— #9?) 
where 


X Wsasen) 


from which it can be verified easily that x(a) = t?*, x(b) =t%4. Thus 
(6.3) is proved. 
(6.4) ° The element ho h(x), where x(a) =x(b) =—1., is contained 


in hy commutes with wo, and, for all K, 


hoa(t) hot =a(—t), hoy(t)hot =y(—?). 


* The idea of deriving the proposition (6.4) from the canonical form of «’(t) is due 
to Tits. The author originally proved (6.4) under the assumption that K is finite, 
using (6.3) and (8.6). 


| 
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Proof. The first part of (6.4) can be proved by setting {1 in (6.1), 
The second part is a special case of wohwot = h€ §, which follows from 
(1.7). The third part follows from (5.12). 


7. The simplicity of G'. We shall prove the simplicity of G* under 
the assumption that the field K has more than 3 elements. If K has only 
3 elements, then G*, which is of order 2°-3*-%, does not seem to be simple, 
although the author has been unable to verify this; also it is unlikely that 
any new simple group appears as a composition factor of G*. In what 
follows, however, we shall arrange the proof of the simplicity of G* in such 
a way that we use the assumption on K only in the last stage. 

Let H ~{1} be a normal subgroup of G*. Denote, as in Section 5, 
by U*, $*, the groups of o-invariant elements in 1 and §, respectively. 


(7.1) H contains an element uh where wé€ he 


‘Proof. By (5.16) we can assume that H contains +=—wuhwo, where 
welt, hE Let ho € G be the element defined in (6.4). Then 
houhy*u*uhwy € H, and hence H. Since y€ we obtain 
(7.1) if yA 1. Hence we shall assume y1. Then from (6.4) and (5.5) 
we have u = with some ¢,€ K. Hence x= B(t))ho.€ H. Let h=h(x). 
Since, by (5.12), 

B (to) h(X) woy (x(B) B (to) y(t) h(X) oo, 
and since h(X)w8(to) € H, it folllows that H contains 7, = y(x(b)*%*t)y’(t) 
for all ¢€ K. By (6.1), 

where X; is defined by xX: (a) = x,(b) = Hence H contains 
Lo = a(— 136-2), (— 
It is clear, in view of (5.6) and (5.12), that 7.€ 16! and that 7.1 
if t40. Hence (7.1) is proved. 
(7.2) H contains an element ~1 in 1. 


Proof. By (7.1), H contains uk A1, we W, he Assume h = h(x) 
~1. Then by (5.11) x(b) ~1. For any ¢€ K, H contains the element 


uhy (t) h-*u-ty (t)-* = wy (x (b) u-ty (t) 
= y((x(b) —1)t) 
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which is 41 if since x(b)*%?—-1—0 would imply x(b) —1 because 
of (5.1). 

(7.3) H contains an element y(t), (0. 

Proof. By (7.2), H contains an element wu1 in WW. 

If w=a(t,)B(te)y(ts) with t,540, then for any ¢€ K, H contains, 


by (5.6), 
u*B(t)*uB(t) (ts + tt) = y(t). 


If u—B(t.)y(ts) with 40, then for any K, H contains 
u-ta(t)-*wa(t) = (te) y(ts— tte) = y(— tte). 
In any case, H contains an element y(t) with ¢0. 


(7.4) If H contains y(t), t 40, then H contains y(2*¢) for all 
2€ K*, and, in particular, y(¢*) € H. 

Proof. Let h(x) € $* be such that x(b) =251. Then x(a) =x(6)? 
= 21-9; x(3a+ 2b) = 291, By (5.13), H contains 

h(X)y(t)h(x)* = y(x(3a + 26) t) = y (2°) 
for any z€ K*. Now set Then Hence H contains 
y(t*). 

(7.5) If H contains y(t), t40, then H contains «(t®*)B(t'-**) and 
h(X)wo, Where x is defined by x(a) = t?-*9, x(b) = 14. 

Proof. Since G* contain w) by (5.9), y(t)€H implies that 
y'(t)€ H. Then from (6.3) and (7.4) we can conclude that H contains 
a (— 139-2) (¢98-2)-1h (x)wo, where x is defined by x(a) = #?-*, x(b) = 
Since 

(— u) B(— 113441) (439+2) 
hy (5.6), H contains 
a (— £99-2) (— £992) B(— = & (49?) y (7) A(X) a. 
Then by (7.4) H contains 2, = a(t®*?) B(#-*")h(xX)oo. Replacing ¢ by —t, 
we see that H also contains 2. = a(— B(t*-*") h(x) wo. Hence H contains 


Hence by (7.4) H contains a(¢8*?)@(#-**). Then H contains h(x), since 
*,€H. Thus (7.5) is proved. 
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(7.6) H contains y(t), and h(x) for all A*, 
where xX is defined by x(a) = x(b) = #?. 


Proof. By (%.3) and (7.5), H contains r= a(t)°*?)B(t'**) for some 
t,€ K*. Hence H contains 


= 


Since wu is an arbitrary element in K, it follows that H contains y(¢) for all 
t€ K. Now we can set ¢=1 in (7.5). Then h(x) =1, and hence w)€ //. 
The rest of (7.6) follows immediately from (7.5) and the fact that (°” 
can be an arbitrary element in K*. 


(7.7) H contains the element h, defined in (6.4). 
Proof. Since H contains w) by (7.6), it follows that H contains 
(—t)*woa(—t) oo? = 
which is, by (6.1), equal to 


where X is defined by x(a) =—t*’, x(b) =—t*. Hence by (7.6) J// 
contains 
a (— t)-*B (t-@)) a (t-*) ho. 


Since H contains a(—?)*8(¢°%1)-* and a(¢+)B(t-@*) by (7.6), we can 
conclude that H contains B(t°%")B(— for any t€ K*. Setting t= 1, 
we have A, € H. 


THEOREM 7.8. If K contains more than 3 elements, then the group (@ 
is simple. 


Proof. Let H A {1} be a normal subgroup of Gt. By (7.6), H contains 
h(x) where x is defined by x(a) =”, x(b) =#*. Hence by (5.12), H 
contains 


B(u)-*h (x) B(w)h (x)? B( —1)u) 


for every ¢40 and uw in K. By (5.1) ¢?—1=0 would imply ¢*=1. 
Since K has more than 5 elements, there exists t€ K* such that {*1. 
Therefore H contains B(t) for all ¢¢ K. Then from (7.6) it follows that // 
contains wo, a(t), B(t), and y(t) for allt¢ K. Hence This proves 
the simplicity of @. 
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8. Further results on G*. First we shall consider 2-subgroups of G’. 
(8.1) If ¢€ K*, then 


B’(t) (t+), 
where X is defined by x(a) = ¢*1, x(b) = 
Proof. Applying ¢, to (6.2), we obtain 
w_y(t) == 2, (t-*) R(Xr,-4-2) ot, (4-7). 


Since any one of (4) commute with any one of 
(WU), aNd SiNCe by (3.8), we have from the above. 


B’ (t) = (t9) (4) = B(E*) (E*), 
where 


X = 
It can be verified easily that x(a) = t*1, x(b) =?'*". Thus (8.1) is proved. 
(8.2) The element 2) £(1)o.8(—1) commutes with wp. 
Proof. From (8.1) we have 


B’(1) =B(1)ooB(1), B’(— 1) = B(—1) oo8(—1). 
Hence a) = 8’(1)B(1) = B(—1)f’(—1), from which it follows immediately 


that = Zo. 


THEOREM 8.3. The group G' contains an elementary abelian 2-sub- 
group of order 8. 


Proof. Let a) and ho be the elements defined in (8.2) and (6.4), 
respectively. Then, by (8.2) and (6.4), it is clear that the 3 elements 
Voy ho, ) Of order 2 commute with each other. By (5.16), ho and w, generate 
a subgroup of order 4 which does not contain vy. Hence 2, ho, and oy 
generate an elementary abelian 2-group of order 8. 

Now assume that K is a finite field of g—3?"*! elements, where 
u==0,1,2,- > -. Then the automorphism 6 of K defined by 1?=#”, where 
m= 3", satifies 3071. Hence the group G' can be defined. Since K* is a 
cyclic group of order g—1, the subgroup K** of square elements is of index 
2 in K*, By (5.4), —1¢K*. Hence K* is generated by K** and —1. 
Hence by (6.1), (6.4), and (5.16) we have 


(8.4) If K is finite, then G' is the group of all o-invariant elements 
in 


| 
| 
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By (5.16) and (8.4), above, the order of G* is 


P(q—1) + P(q—1) =F (I—1) (F +1). 
Since g—1=2(mod 8), g*-+1=4(mod 8), the above order is divisible by 
8 but not by 16. The 2-subgroup of order 8 given in (8.3) is then a Sylow 
subgroup. Hence 


TuHeoreM 8.5. If K is a finite field of g=3?" elements, where 
n=0,1,2,---, then the order of G is g?(q—1)(q°+1), and a Sylow 
2-subgroup of G* ts an elementary abelian 2-group of order 8. 

All the previously known finite simple groups whose orders contain the 
prime 2 with the exact exponent 3 are: L3(2), Le(q) with gq=+7 (mod 16), 
and the alternating group of degree 7 (cf. [1]). Among these only the first 
group, of order 168, has an elementary abelian Sylow 2-subgroup, and for 
the rest the Sylow 2-subgroups are dihedral. Hence Theorem 8.5, above, 
shows that the finite simple groups of the form G* are new. In passing, we 
note that all the known finite simple groups whose orders contain the prime 
2 with the exact exponent 2 are the groups Le(q) with g=+3 (mod8), 
q>3 (ef. [1]). The Sylow 2-subgroups of these groups are, of course, 
elementary abelian, since they cannot be cyclic. 

We shall devote the rest of this section to a study of the group 8 of all 
z€ K* such that x(b) =z for some h(x) €G'. If K is finite, then (8.4) 
shows that S=K*. In the general case, we already know that z*#1¢€ § for 
any z€ K* by (5.13), and that —1€ S by (6.4). Except for these, however, 
we know very little about S. We shall prove 


THEOREM 8.6. We have 1+ 2°€S for any z€ K. 
First we shall prove a few lemmas. 
(8.7) Let 
L3ar20 (t32) (ts: ) (ter )@a(tio) wo 
with € if 
t:9 1 — ts: F0, 1 + tote: — ts: AO, 
then 


L_(3a+b) (— 1) & = (Ui) Learn (Wer) 
Ly (Uor) La(U10) A(X) woz’, 


where x’ € U, where x is defined by 


X(@) = (1 + tote: — ts1)*(1— 
X(b) = (1+ trots: — ts1)* (1 — 
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and where 


= bra + bio — — tio *ts1)? 
= — toy — tio?) (1 
= — (tio — — 


Wor = — — + tor — bio 


+ (tox = (tio — te; — tio?) Sts, (1 — 


= — 3tg1? — forts1 — (tox — tio 


Proof. We merely follow the steps (1.15)-(1.17) mechanically to find 
the canonical form of the element (—1)% = wet_»(—1) wg We 


omit the details. 
(8.8) Let 


Y = Lza+2b Lsarb (Us1 ) Learb ) Lard (ti ) Lp ) La (U10) Wo 


with wj€ K. If 


Uor 1 — Uy, — U9 0, 1 + Uoitls2 — 


then we have 
(—1) y = y"h(X1) 


where 7’ and y” are in ll, and where x; is given by 


Xi (a) = (1 + Uits2 


U11°) (1 == 


(1 + Upitlg2 — U11°) (1 — — Uoi10)*. 


Proof. We make use of the automorphism o of G. Set 


Then by an easy computation we get 


8 


Hence 40, are clear. Also 


ts1 — biota: = — tho == (u; tse)? ~1. 


Hence we can apply (8.7) to x. Set 
(— 1) & = 2h (X) wor’, 


where 2’, 2” are in 1t and where x is given in (8.7). 
above, we get 


(—1)y = yh (X) 


Applying o to the 


45% 
W 
re 
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where y” = (2”)%, y’ = (2’)*. Hence we can set We shall compute 
the values of X,. By (8.7) we have 


X(a@) = (1 + (1 — 


Since x,(b) =x(a)** by (5.10), we have 
Xi (b) = (1 + Uortls2 — (1 — — *. 
Similarly we can compute X,(a). Thus (8.8) is proved. 
Proof of Theorem 8.6. Set «= y(— with A0, + 1. 
Then 
= (30+?) xy (t) wo 

Since ¢40, we can apply (8.7) to 2. We obtain 

Y = (— 1) = (Use) * Va(Ui0) 
where 2’ € Ul, and 

We wish to apply (8.8) to y. The condition on uj in (8.8) are easily found 
to be equivalent to 74 +1, and hence by (5.4), to ¢ A241. Since these 


are assumed, we can apply (8.8) to y, and obtain 


= (— 1A) y = yh (X1) woy’, 


where y’, y” are in (since and #’(—1) are a-invariant), and where /(x;), 
which is also o-invariant, is given by 


x, (b) (1 + + 


Since x is in G', it is clear from the above that h(xX,) is in G. By (6.1) 
and (6.4), K**C 8. Then the above implies that 1-+ § whenever 
tA0,+1. Write z= #?%!, Then and the conditions t40, + 1 are 
easily seen to be equivalent toz~0,+1. Ifz—0or +1, then 1+ 27 = +1, 
and hence by (6.4) 1+ 22€ 8. Thus (8.6) is proved. 


9. Automorphisms of G'. If ¢—?’ is an automorphism of the field K 
which commutes with 6, then, since G is generated by a(t),- - -,y’(¢), it 
can be seen easily that the map a(t)—>a(t’),-- -,y(t)—y'(#) can be 
extended to an automorphism p of G*. We shall call p the automorphism of 
G induced by the automorphism t-—?’ of K. If K is a finite field, then any 


te 
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automorphism of K commutes with 6 and hence induces an automorphism 


of G. We shall prove 


TuHeorEM 9.1. If K is a finite’ field possessing more than 3 elements, 
then every automorphism of G* can be written uniquely in the form pip», 
were p; is an automorphism of G induced by an automorphism of K and where 
p2 is an inner automorphism of G. 


Proof. Let «—> 2x? be an automorphism of G*. Since ll" is clearly a 
Sylow 3-subgroup of G*, and since any two Sylow 3-subgroups are conjugate, 
it follows that 1 and (11')? are conjugate; there exists an inner auto- 
morphism p’ such that pp’ maps UU onto itself. Hence we can assume from 
the beginning that 


(9.2) = 


By (5.6) it can be seen easily that the center of WU consists of the 
elements y(t), ¢€ K. Hence by (9.2) there exists an automorphism t— ?’ 
of the additive group of K such that y(t)*"=y(t’). Write c—1’0. Since 
K is assumed to be finite, there exists, by (8.4), h—=h(x)€ G such that 
x(b) =c***, Denote by p’ the inner automorphism defined by 2” =hzh". 
Then y(t)? = = y(x(b) = y(c%l’), and hence y(1)" = (1). 


Clearly p’ maps ll* onto itself. Hence we can assume from the beginning 
that p satisfies not only (9.2) but also 
(9.3) y(t)*=y("), 


where {> ? is an automorphism of the additive group of K such that 1’=—1. 


We shall show that the group U9? is the normalizer of 11 in Gt. It is 
clear that 11'§* normalizes 1. Suppose x€ G* normalizes WU’, and let 
t=uhwou, be the canonical form of z given in (5.16). Then w normalizes 
W. Hence ow, and we have ow=1. Thus 7€1U§'. Then by (9.2) 
we get 


(9.4) = WS, 


from which it follows that there exists a homomorphism h(x) > h(x’) of H? 
into itself such that 


(9.5) UW. 
By (9.3) and (9.5) we obtain 


x’ (x(b) 924)’, 


10 See footnote 3. 
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Setting {= 1 in the above, and using 1’—1 we obtain 
(9.6) X’(b) (x(b) 
Then from the above we get 

(x(b) 99?) (x(b) 98)’. 


Since, by (8.4) and (5.3), every element in K* can be written in the form 
x(b) 9%? with h(x) € G+, it follows that w’t’ = (ut)’ for any u, t in K. Since 
t—? is an automorphism of the additive group of K, we have thus shown 
that the map ¢—>? is an automorphism of K. Denote by p’ the automorphism 
of G* induced by the inverse of the automorphism t— ¢’. Then y(t’)? = 
=y(t) for all ¢¢ K. Also, it is clear that p’ maps 11' into itself. Hence 
we can assume, for the proof of Thseorem 9.1, that #’—¢# for all t€ K. 
Then 

(9.7) y(t)? =y(t) 

for all ¢€ K, and, by (9.6), x’=x. Hence, by (9.5), 

(9.8) h(x)Ph(x)-2 € W, 


Now, by (5.16) one can show easily that woll*wo? NM $= {1}, which, 
combined with (9.4), shows that wo? must be of the form wo? = uhww’, 
where u, w’€ Ut, RE SH. Then wo? —1 implies =1, which in 
turn implies u’u—1. Denote by p’ the inner automorphism of G'. 
Then wo’? hwo, and we can replace p by pp’ in (9.2), (9.7), and (9.8) 
without losing the validity of the statements. Hence we can assume 


We shall show that p—1 follows from (9.2), (9.7), (9.8), and (9.9). 
This is clearly sufficient for the proof of Theorem 9.1. By (9.8) we 
can write h(xX)’=uxh(x) with u.€ it. Substituting this and (9.9) to 
wo *h(X)wo = h(x)? one can derive easily ux—1. Thus we have, for all 
h(x) € $', 

(9.10) h(x)? =h(x). 


By (9.2) we can write a(1)?’=«a(c)B(¢,)y(¢2) with c, c,, and c, in K. 
Then by (9.10), (5.12), and the fact that x(b) can be regarded as an 
arbitrary element in K*, we have 


(9.11) a(t)? a (ct) B(c,t99**) y 
for all t€ K. 
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Let 8' be the center of 1', and 8/3? the center of U?/3'. Then by 
(5.6) one can see easily that 8? consists of the elements B(t)y(u),¢,wé K. 
Hence B(t)° =B(t:)y(te). Let ho be the element defined in (6.4). Then 
by (9.10), ho? =o. Hence we have B(t)? = 8(t,)y(—t#.). Then t, 
t,—=0. Thus we have B(t)*’=£8(t,). Combining this and (9.11) with the 
second identity of (5.6) one can deduce easily cut, ut for all u, ¢ in K. 
Hence c0, and Thus we have proved that 
(9.12) B(t)? = B(c*t) 
holds for all ¢¢ K. Now combining (9.11) and (9.12) with the first identity 
of (5.6) one can deduce 

for all t,u€ K. Since 
(t + 36+2 + $39, + ty? 
we have — — ¢,) tu®? for all K. Now assume that K 
has more than 3 elements. Then we have c, — 0, c+—c*#t—c, —0. Hence 
—1, c=—1. Thus we have 


a(t)? —=a(t)y B(t)? = B(t) 


for all ¢€ K. Combining these again with the first identity of (5.6), we can 


deduce 


for all t;w€ K. Since 
(t +- wu) __ £30+2 + (tu uw?) + ( ¢? tw) 


we have, by setting w—1, c,(¢89—t¢)(t—1) =0 for all ¢€ K. It can be 
easily seen that ¢°?—-t=—0 implies that {0 or +1. Since K is assumed 
to have more than 3 elements, we conclude c,.—0. Thus we have proved, 
for all t€ K, 
(9.13) a(t)’—=a(t),  B(t)*—B(t). 

Now, by (8.1), (9.7), (9.9), (9.10), and (9.13), we have 


h (Xo) (— 1) (Xo)-* = B(1)h (Xo) oo8(1). 
Then by (5.12) we have 


B’ = B(1) B(1)*B"(1). 
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Then wolltwo? M = {1} implies A(X.) =1. Thus 
we have proved wo’ wo. Then by (9.7) and (9.13) we have p—1, since G 
is generated by the elements a(t), B(¢), y(t), and wo. Thus the proof of 
Theorem 9.1 is complete, except for the uniqueness part. 

To prove the uniqueness, let t—?’ be an automorphism of K, and 
suppose that the automorphism of G* induced by it is the inner automorphism 
«—axa*. Then clearly =U’, and, since the normalizer of on G* 
is 10'S, a must be of the form a—uh, where w€ U’, hE H*. On the other 
hand, since 1’ = 1, (— 1)’ = — 1, a must commute with wy) = B(1)8’(— 1)B(1). 
From this we can conclude easily u—=1, h =1, and that t— ? is the identity 
automorphism of K. Thus the uniqueness part is also proved. 
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ALGEBRAIC ISOMORPHISMS BETWEEN ORDERED BASES.* 


By J. R. RinGRosE. 


1. Introduction. Let H be a complex Hilbert space, and denote by 
(H,H) the class of all bounded linear operators acting in H. An operator 
algebra over H is a subset J of (H,H) such that, if A,Be J and A is a 
complex number, then A+ B,AB,AA€E JF. If F1, are operator algebras 
over Hilbert spaces H,, H, respectively, an algebraic isomorphism from J, 
onto J, is a one to one linear and multiplicative mapping ¢ from J, onto 
J». If there is a non-singular continuous linear transformation Q from H, 
onto H, such that 

¢(T) =Q°TQ (TEF:), 


we say that ¢ is a spatial isomorphism, and is implemented by Q. The main 
object of the present paper is to prove that, if J, and J. are ordered bases 
(in the sense of [3]), then every algebraic isomorphism from J, onto J: 
is spatial. 


2. Notation and definitions. Let J be an operator algebra over a 
Hilbert space H. We denote by J* the algebra {7*: T€ J}. By a hull of 
J we shall mean a closed subspace ZL of H which is invariant under each 
operator in J. A hull idempotent of JF is an idempotent in (H,H) whose 
range is a hull of J. A self-adjoint hull idempotent will be called a hull 
projection. <A full class of hull idempotents of FJ is a family ¥ of hull 
idempotents such that, given any hul! LZ of §, there is a member of J 
whose range is L. 

Following Kadison and Singer [3], we shall say that the operator algebra 


algebra J is a maximal triangular algebra if 
(i) is a maximal abelian sub-algebra of (/1,H) ; 
(11) amongst algebras satisfying (i), J is maximal. 


Under these conditions, @ is a commutative von Neumann algebra, called the 
diagonal of J. It is shown in [3] that the hulls of a maximal triangular 
algebra are linearly ordered (by inclusion), and that each hull projection lies 
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in the diagonal of the algebra. If the hull projections of a maximal triangular 
algebra J generate the diagonal (as a von Neumann algebra), we shall call 
J an ordered basis. The motivation for these definitions is indicated in [3]. 
We note that, owing to the need in this paper to consider non-orthogonal 
projections, our use of the term hull differs slightly from that of [3]. 

We now give an example which is of some importance in the theory. 
Let p be a non-negative measure defined on the Borel subsets of the closed 
interval [0,1], and denote by Hp the space L?({0,1]: p) of p-measurable 
functions whose squares are p-summable over [0,1]. Given a p-essentially 
bounded function f(t), let 1M, be the operator of multiplication by f. The set 


Ap = {M;: fe L([0,1]: p)} 


is a maximal abelian sub-algebra of (Hp, Hp), the multiplication algebra over 
H,. This algebra contains the orthogonal projection H(r) of multiplication 
by the characteristic function of +, whenever + is a Borel subset of [0,1]. 
We refer to E(r) as the spectral measure in Ap. We define =F ([0,A]), 
Ey.=E((0,A)) (OSAS1). Now let Fp be the class of all bounded 
linear operators acting in Hp, which leave invariant the range of each JL). 
Then Jp is an ordered basis over Hp, with diagonal @p and hull projections 
Ey, Ey. (OSA=1). For these facts we refer to [3]. The importance of 
this example lies in the following representation theorem, which is proved 
in [3] (see p. 248). 


THEOREM. Let J be an ordered basis over a separable Hilbert space H. 
Then 1t is possible to choose a non-negative measure p defined on the Borel 
subsets of the interval [0,1], and a unitary operator U from H onto Ip, 
in such a way that J 


As an immediate consequence of this theorem, we note that an ordered 
basis consists exactly of those operators which leave invariant each of its 
hulls. 

If J is an operator algebra over a Hilbert space H, we denote by J’ the 
commutant of J: that is, 


5’ = {S¢(H,H): ST=TS (TE 


The von Neumann algebras over H can be defined as those for which 


J = J*— (J’)’. 
3. Some preliminary lemmas. 


Lemma 1. The following properties of elements of an operator algebra 
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J over a Hilbert space H are conserved by any algebraic isomorphism between 
two operator algebras. 


(i) FE is idempotent. 
(ii) E’ are idempotents and E(H)C H'(#). 


(iii) are idempotents and E(H) (Hf). 
(iv) FE is a hull idempotent of J. 


Proof. In order to prove parts (i), (ii) and (iv) we shall state in each 
case an equivalent property which is obviously preserved by algebraic iso- 
morphism. Part (iii) is immediate from (ii). 


(i) E?=E. 
(ii) 
(iv) TE—ETE (T¢9). 


LemMaA 2. Let be maximal triangular algebras, and let be 
an algebraic isomorphism from J, onto Jz. Then if F (CI:) ts a full 
set of hull idempotents for the family = {o(#): EE F} ts a full 
set of hull idempotents for Fo. 


Proof. It is immediate from Lemma 1(iv) that ¢(#) is a family of 
hull idempotents for J... Suppose now that L is any hull of Jz, and let F 
be the corresponding hull projection. Since J, is a maximal triangular 
algebra, F € J.. Thus we may set F—¢(F), where HE J, and (by Lemma 
1(iv)) is a hull idempotent of J,. Since F is a full set of hull idempotents 
for J,, there is an element HY’ in F¥ having the same range as FL. Let $(L’) 
=F’, Then F’€ ¢(#), and has the same range as Y (by Lemma 1 (iii) ). 
Since the range of F is L, it follows that ¢(#) is a full set of hull idem- 
potents for J >. 


Lemma 3. Let e, f be elements of the Hilbert space Hp, and let S be 
the bounded linear operator acting in Hp which ts defined by the equation 
Sxr=<a,e>f (t€ Hp). Then a necessary and sufficient condition that S 
should be a member of the ordered basis Fp is that there exist a pE[0,1] 
such that FE, e=0, E,f=f. The notations are those introduced in § 2. 

Proof. Suppose that S€ Tp, and let »~—inf{rA: The right 
continuity of #, implies that #,f =f, while from the definition of p it follows 
that AnfAf (A<p). Now 

= || Eye |i? f, 
Hy SE ye = || Eye ||? 
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Since SE, = while when A <p, we deduce that for such A, 


Eye =0. When yp we obtain L,e—0. 
Now suppose conversely that for some p»€[0,1] we have 
E,.f=f. lfiA< yp, then 


= e>f = <2, Lye>f = Hp), 


and thus SE, =F)SE, (A<p). On the other hand, the last equation is 
satisfied when 4 = yp, since then f= )f. Hence S leaves invariant the range 
of each Fy, and is therefore a member of Jp. This completes the proof of 
the lemma. 


Lemma 4. Let J be an ordered basis over a separable Hilbert space H, 
and suppose that AE J. Then A has rank unity tf and only if (i) A. 
and (ii) if B,CE F and BAC =0, then at least one of BA, AC ws zero. 
Consequently, the property of having rank unily is preserved by algebraic 
isomorphism between ordered bases. 


Proof. We first suppose that A has rank unity. Then we may choose 
e, f (€ H) in such a way that Avx=—<cu,e>f (e€ H). We then have 


= <2x,e>Bf, ACr=<2,Cte>f, BACr = <x, C*e>Bf (xe 


If BAC = 0, one at least of C*e, Bf is zero; hence one at least of BA, AC is 
zero. Since A +0, conditions (i) and (ii) are satisfied. 

Now suppose that A has rank exceeding unity. Because of the repre- 
sentation theorem for ordered bases which is stated in § 2, it is sufficient to 
consider the case in which J =@Jp. We shall use the notations introduced 
in § 2, and write J for the range of the projection Fy. Lei 


a—sup{A: Ly, —0}, B=inf{rA: —T}. 


Thus Hoa-—0, Hg—I. We now consider separately four possibilities, 
enumerated below as cases 1 and 2, in each of which there are alternatives 


(a) and (b). 
Case 1. dim{A(Ly))} >1 for some < B. 


In this case let w,, We (€ Ly) be chosen in such a way that Au,, Au, are 
linearly independent, and let e(40)€ Hp OL). We now sub-divide case 1 
into two alternatives. 


Alternative (a). For some p> a, (I—F,)Aw, (I — E,)Auz are linearly 


independent. Let f(~40)€ L,. We may assume (by making a non-singular 


I 
t 
| 
{ 
{ 
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transformation on w,, that (I—FH,)Au,, (I—FH,)Aus are orthogonal. 
We then define 


Cr = Br = <x, (I — Hp). 


It is easily verified by means of Lemma 3 that B,Cé€ Jp; furthermore, 
BAC =0. However, neither BA nor AC is zero, since ACe~0 ~ BAiio. 


Alternative (b). For every p>a, (I—F,)Au, are 
linearly dependent. We may choose scalars 1,, m, (u >) such that 


+ | m, | (I — E,) Au; + m, (I —E,) Au, = 0. 


As » \@ through a suitable sequence, J, and m, tend to limits 1, m (respec- 
tively), and we have 


 1(1—Ea)Au, + m(I— Eq) At, =0. 


It follows that (I — Fa)Au,, (I are linearly dependent, and hence 
that If we now set up + then Ay = EgAuy. Since 
Ea 
the function taking the value unity at « and vanishing elsewhere. Thus the 


E, we may describe Ha as the projection of multiplication by 


single point a has positive p-measure, and Hq is an operator of rank unity. 
Since dim{A (Z))} > 1, we may choose uw; ( € Ly) such that = 0 Aus. 
If we now define B= Eg, Cr = <x, (x€ Hp), then it is easily verified 
that B,C€ Jp. Furthermore, BAC ~0, but ACe 40 BAwW. 


Case 2. supdim{A(L))} =r. 
A<B 


In this case we have dim{A(L,)}—r for some p< B. Then A(Ly) 
=A(L,) (uSA< B), and hence A(Lg_) = A(L,) (where is the range 
of the projection Hg). We deduce that dim{A(LZg)}—r=1, and thus 
that Lg. AHp—Lg. It follows that Lg has codimension unity in Hp. Let 
e(40)€ Hp OLg. Since Hp is spanned by e and Lg, A(Hp) is spanned by 
Ae and A(Lg_). From the inequalities dim{A(Hp)} = 2, dim{A (Lg-)} S1 
we may now deduce that 


dim{A(Hp)}=2, dim{A(Lg.)}=1, Ae¢ A(Lp_). 


Let uo be an element of Lg. such that Au, 0. We now sub-divide case 2 
into two alternatives. 


Alternative (a). For some p> a, (I—F,)Auo, (I— E,) Ae are linearly 
independent. Choose f(~0) € Ly, and u,€ Hp © L,, such that 


<Aé, = (Ao, 
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We then define 
Bz = (2, Uf, Cz = <2, ede Ho), 


and may verify that B,C € Fp, that BAC —0, but that BAA0AAC. 


Alternative (b). For each (I —E,)Ae are linearly 
dependent. We may deduce, by the type of argument used in case 1 alterna- 
tive (b), that (I—E,)Au, (I—Ea)Ae are linearly dependent. Hence 
there is a linear combination u, of e and uy, such that 04 Au, EeAu,; 
and by following the method of case 1 alternative (b), we may construct 
us(€ Hp) such that We define <z, 
(c€ Hp). Once again we may verify that B,C€ Fp, that BAC —0, but 
that BAA0SAAC. 

The four situations considered above are jointly exhaustive, and in each, 
the assumption that A has rank exceeding unity leads to a proof that con- 
dition (ii) of the lemma is not satisfied. Hence the two stated conditions 
are fulfilled if and only if A has rank unity. Since these conditions are 
obviously invariant under algebraic isomorphism, the lemma is proved. 


~ 


Lemma 5. Let J be an ordered basis over a separable Hilbert space H, 
and suppose that TE J. If ST =O whenever SEF and S has rank unity, 
then T =0. 


Proof. It is again sufficient to consider the case in which J = Jp, where 
p is some non-negative measure defined on the Borel subsets of [0,1]. Suppose 
that 740, and choose u( € Hp) such that Tu0. We consider two cases. 
In the following, « is defined as in the proof of Lemma 4. 


Case 1. For some X>a, (I—E,)Tu¥0. Choose f(0) such that 
f=F)f, and define Sr = <x, (I— E,)Tu>f (x€ Hp). Then we may verify 
by means of Lemma 3 that S € Jp. Furthermore, S has rank unity and 
STu +0. 


Case 2. For every X>a, (I—E,)Tu=0. Then (J—F,)Tu=0, 
and thus He~0. If we now set S= Ko, it follows that S is an operator of 
rank unity, and that STu= TuS<0. 

The two cases considered above are jointly exhaustive, and in each of 
them the assumption that 70 leads to the construction of an operator S 
of rank unity such that S€ FJ, and ST=+0. Hence the lemma is proved. 


Lemma 6. Let J,, J, be ordered bases over separable Hilbert spaces H,, 
H, respectively. Let , y be two algebraic isomorphisms from J, onto J» 
such that ¢(S) =y(S) whenever S€ FJ, and S has rank unity. Then 6=y. 


0 
r 
r 


ISOMORPHISMS BETWEEN ORDERED BASES. 469 


Proof. Let Té€J;, and define T7—¢(T)—y(T). Let S’ be any 
operator of rank unity in J,. We may set S’—¢(8), where SE J, and 
(by virtue of Lemma 4) S has rank unity. Hence the operators S, ST have 
rank at most unity, and we have 


¥(S) =9$(S) =8’, 
0=$(ST) —y(ST) 
= $(8)4(T) 
= 8’'{$(T) —y(T)} 
az 


This holds whenever S’€ J, and S’ has rank unity. Lemma 5 now implies 
that = 0, that is, 6(7) =y(T). Since T is an arbitrary element of Ji, 
we have ¢=y. 


4. Diagonal isomorphisms from an ordered basis onto itself. Let J 
be an ordered basis over a separable Hilbert space H, with diagonal @. 
By a diagonal isomorphism on J we shall mean an algebraic isomorphism ¢ 
from J onto itself, such that ¢(A4) =A (AE Q). 


THeEorEM 1. Let J be an ordered basis over a separable Hilbert space 
H, with diagonal C, and let $ be a diagonal isomorphism on J. Then ¢ is 
a spatial tsomorphism, and may be implemented by a non-singular trans- 
formation belonging to the diagonal CQ. 


Remark. For the case in which the ordered basis ¥ has an atomic 


diagonal, this result is contained in Theorem 3.2.3 of [3]. 


Proof. We may suppose without loss of generality that J is the ordered 
basis Jp described in §2. We define 


a—sup{r: p([0,A]) 8 —inf{a: p([A,1]) = 0}. 


It is clear that p([0,a)) —0—p((8,1]). We may ignore the trivial case 
in which the measure p is concentrated on a single point (and Hp is one- 
dimensional), and deduce that 8 >a. By a linear transformation on the 
real numbers we may reduce to the case in which B=1, «0. Thus we 
have H=Hp, J and 


(1) p([0,A]) #0 Ap([A, 1]) (0<A<1), 
Ey 40 AI—By. (0<A<1). 


The remainder of the proof is divided into a number of sections. 
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(a) Proper dissection points. These are the points A(€[0,1]) such 
that both the intervals [0,A], [A,1] have positive p-measure. Under our 
assumptions, every interior point of [0,1] is a proper dissection point: an 
end-point of [0,1] is a proper dissection point if and only if it has positive 
p-measure. Hence the proper dissection points form an interval X such that 


(3) (0,1)CXC [0,1], p(X) =p([0,1]). 


(b) The functions a(t), b(t). When e,f€ Hp, we denote by e@f the 
linear operator S from Hp into itself which is defined by Sx = <a, e>f (x € IT). 
When 7 is a Borel subset of [0,1], we write x, =x,(t) for the characteristic 


function of r. 


Given a proper dissection point A, we may consider the operator 
This operator has rank unity, and it is immediate from 
Lemma 3 that F,€ Jp. It is easily verified that 
p J 


Fy 


By applying the diagonal isomorphism ¢, which leaves invariant the operators 
E), Ey. Cp, we obtain 


(4) = (J — £)-). 
From Lemma 4 it follows that the operator ¢(/) has rank unity; we may set 
$(Fy) =H 


where @), 6, are non-zero elements of Hp, and are determined to within 
scalar factors £, y (respectively) such that &7—1. From (4) it follows that 
(possibly after modifying a, b, on p-null sets) we have 


ay(t) =0 (t¢< A), 
(5) b(t) =0 (t>d). 


Our immediate objective is to show that, if a, and by are suitably normalised, 
then we have 


a(t) =Xpvu(t)a(t), by (t) = (t) b(t) (AEX), 
where a and 6 are fixed p-measurable functions. 


Suppose therefore that »,A€ X and »=A. We have 


OF 
= (Xp @ (I — E,_) 
= FE) Xt0,u1)- 
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By applying the diagonal isomorphism ¢ we obtain 
(a, © by) (I — Ey_) = Ey (a, b,) 
that is 


(6) (Xtu,114.) @ by = dy ® FO. 


We now choose and fix a proper dissection point « If A€ A andA=z, 
we consider equation (6), with w replaced by « If AC X and A=a, we 
consider equation (6), with A, » replaced (respectively) by «, A. We conclude 
that the functions a), b, may be normalized in such a way that, with the 
possible exception of certain sets of p-measure zero, we have 


ay (t) (4) | 


(7) (t) a(t) (AE X,A=2), 
(4) a(t) = da(t) 
(8) 


Having normalised the functions a, b, (A€ X) by means of equations (7), 
| 
(8), we remark that 


with the possible exception of certain p-null sets. The existence of some 
normalisation of the functions a, a,, b), b, for which (9) is valid is an 
immediate consequence of (6). To prove the validity of (9) for the particu- 
lar normalisation given by (7), (8), we note that both a,(¢), a,(¢) are equal 
to da(t) at almost all points of the interval [max(y,A,#),1]: and that the 
three functions do not vanish at almost all points of this interval (since 
otherwise, in view of (5), one of them would be a p-null function, contrary 
to hypothesis). Hence the normalisation for which (9) is valid coincides 
with that defined by (7) and (8). 

We now choose an increasing sequence of closed intervals Xn = [An pnl 


whose union is Y, and define 


a(t) 
b(t) = (OStSyp), 


The functions a(t), b(¢) are defined throughout Y, and hence at almost all 
points of [0,1]. From (9) we deduce that, with the possible exception of 
certain p-null sets, we have 


ay (t) (4), 


5 


(10) 
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Finally, we deduce from the defining property of the functions a, 6), and 
from equations (10), that 


(11) $(XpA,11 @ = (AEX). 


(c) The relation b(t)a(t) =1. Let A,4€X and suppose that A= yp. 
An elementary calculation shows that 


@ (Xtw17 = (LA, ) Xp0,a1)- 


This relation may be re-written in the form 


(Xp Xt0,03) Eu. (I — (Xtu.13 Xt0,u1) 
=p([A, (Xtw11 Xt0,u3)- 


lf we apply the diagonal isomorphism ¢ to the last equation, and use (11), 
we obtain 


(4X Bu. (I — Ey-) (0X 


=p([A, #])#y (4X 
that is 


An elementary calculation shows that the left hand side of (12) is equal to 


Since, as has been verified in the above calculations, 


(4X = $ (Xtu,11 @ FO, 
we deduce that 


Tt follows that b(t)a(t) = 1 at almost all points of XY, and hence at almost 
all points of [0,1]. 


(d) Proof that a, b are p-essentially bounded. Suppose that f,g € Hp, 
that A € X, and that the functions fX;,,1}, 9Xto,,) are not p-null functions. Then 
the operator fX;,,1;® gXto,,; has rank unity, and is a member of Jp. We 
define 


fi=1/(14+|f I), fo=f/(1+|f]), 
ge=—g/(1+]g)). 


f 
a 
I 
t 
| 
( 
{ 
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Then f1, fo, g1, g2 are p-essentially bounded measurable functions. With these 
we associate the corresponding multiplication operators My,,,- - -,M,,, which 
belong to the diagonal (fp of the ordered basis Jp. From the (easily verified) 


formula 


My, (ho ® hs) Mn, (hgho) ® (Ayhs) 


(Ai, hs € L ({0, 1] :p) hs € Hp), 


and the relations ff, =f., gg1=g2, we deduce that 
(13) Mo, 9Xt0,.1) My, = Moe Xt0.11) My 


Let us now operate on equation (13) with the diagonal isomorphism ¢. We 
may suppose that 


9Xt0.11) =F @ G, 


where F', G are non-zero elements of Hp. We obtain 


M,,(F® G) M;, = My, (aXp,13 @ Mp, 


that is 
(f:F) @ = (af @ (bg2Xt0,r1)- 


It follows that F and G may be normalised in such a way that (with the 
possible exception of certain p-null sets) 
= 
= 


Hence 
F = 


G = bgXto,n1- 


We deduce that € Hp, and that 


It is apparent that this equation remains valid when one or both of fxXp,1, 
9Xto,4} 18 a p-null function. 

We now choose a proper dissection point A, and an element f of Hp, 
such that f(t) 40 (OS¢t51). We keep f and A fixed, but allow g to vary 
in Hp. Since ¢ is an algebraic isomorphism from Jp onto itself, and pre- 
serves the property of being of rank unity, we deduce from equation (14) 
that as g runs through Hp, bgXx;o,,; runs through all elements of Hp which 
vanish on the interval (A,1]. It follows that both b and its inverse function 
b* are p-essentially bounded on [0,A]. Similar considerations show that a 
and a are p-essentially bounded on [A,1]. Since bd=1, we may deduce 


that a, a-' are p-essentially bounded on [0,1]. 
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(e) Conclusion of the proof of Theorem 1. Equation (14) may be 
written in the form 


(15) = M, 9Xt0,x1) Ma 
= (fXp,11 9; 


where Q (€ Cp) is the operator Mj. It follows from Lemma 3 that every 
operator of rank unity in Jp can be written in the form fX,),1] @gX¢o,a) for 
some A€ X. From (15) we deduce that ¢ coincides, in its action on operators 
of rank unity, with the spatial isomorphism induced by Q. It now follows 
from Lemma 6 that ¢ coincides with this spatial isomorphism. This com- 
pletes the proof of Thorem 1. 


5. Symmetric isomorphisms between ordered bases. Let J; be an 
ordered basis over a separable Hilbert space H;, with diagonal (; (11.2). 
By a symmetric isomorphism from J, onto J, we shall mean an algebraic 
isomorphism ¢ from J, onto J, such that ¢(A*) —¢(A)* (AE G,). 


THEorEM 2. Let J; be an ordered basis over a separable Hilbert space 
Hj, with diagonal G; (t=1,2), and let ¢ be a symmetric isomorphism from 
J, onto FJ. Then $ is a spatial isomorphism, and may be implemented by 
a transformation of the form (=Q,U, where U is a unitary operator from 
H, onto H, and Q,€ dy. 


Remark. For the case in which each of the ordered bases has atomic 
diagonal, this result is contained in Theorem 3.2.3 of [3]. I am indebted 
to Professor R. V. Kadison for a suggestion which has enabled me to shorten 
substantially the arguments by which Theorem 2 is deduced from Theorem 1. 


Proof. Let A€ G,. Then A,A*€ J, and we have 
Eds, $(A)* = € 


It follows that (A) € and that C Since ¢(@,) is a maximal 
abelian sub-algebra of J2, while @. is abelian, we may deduce that $(,) 
=(,. Thus ¢ induces an adjoint-preserving algebraic isomorphism from Q, 
onto @,; and (; is a maximal abelian self-adjoint sub-algebra of (Hi, H;) 
(t—1,2). It follows (see the uniqueness clause of Theorem 5 of [5]) that 
there is a unitary operator U from H, onto H, such that 


(16) ¢(A) =U-AU (A€ G,). 


We shall now prove that 


(17) 
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For let ¥, be the family of all hull projections in J,;. Then ¥, C C, and 
¥, is a full set of hull idempotents for J,. If we now define 


(18) F2—=¢(F:1) = 


it follows from Lemma 2 that #. is a full set of hull idempotents for J». 
Hence, for i= 1,2, F; consists of all operators S which act in H; and satisfy 
SE=ESE ¥;). From (18) it is immediate that This 
proves (17). 

We now define 


(19) y(S) =U¢(8)U> 


From (16) and (17) we deduce that y is an algebraic isomorphism from J, 
onto itself, and that y(A) =A (A€@Q,). Thus y is a diagonal isomorphism 
from J, onto itself, and (by Thesorem 1) is a spatial isomorphism which 
may be implemented by an operator Q, (€ @,). Hence 


= Q178Q1 (SE J:), 


and 


$(S) = Uy (S)U 
= 
= Q78Q 


where Q9=@,U. This proves Theorem 2. 


6. Algebraic isomorphisms between ordered bases. 


THEOREM 3. Let J; be an ordered basis over a separable Hilbert space 
H, (t=1,2), and let be an algebraic isomorphism from J, onto FJ». 


Then ¢ is a spatial isomorphism. 


Proof. In view of the representation theorem stated in § 2, it is sufficient 
to consider the case in which J, = Jp, where p is some non-negative measure 
defined on the Borel subsets of [0,1]. We shall use the notations introduced 
in §2. The proof is divided into a number of stages. 


(a) Consider the algebra ¢(@p). This is a maximal abelian sub-algebra 
of Jz, and since J» is weakly closed, the same is true of ¢(Cp). A fortiori 
¢(Cp) is uniformly closed and, with operator bound as norm, forms a com- 
mutative Banach algebra. Thus ¢ induces an algebraic isomorphism y from 
the commutative von Neumann algebra (Zp onto the commutative Banach 
algebra ¢((p). It is clear, from the definition 


A has an inverse in J} 


r4 = inf{p: |r| 


|_| 

(SE J1); 
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of the spectral radius rs of an element A in a Banach algebra J, that y 
preserves spectral radius. Now 


ra=||4 | (A € Gp), 
since Cp consists of normal operators; and 
Tya) S| ¥(A)|| (A€ Gp), 


by a well known inequality (see, for example, [2], p. 567, Lemma 4). Thus 


we have 
| A S| |) (A € Gp). 


Hence (considering @p and ¢(Cp) as Banach spaces) y~ is a continuous 
linear operator from ¢(CQp) onto Gp. ‘The inversion theorem (see, for 
example, [2], p. 57, Theorem 2) implies that y is also continuous. We may 
therefore choose a constant K such that 


(20) (A€ Gp). 
(b) Let E(r) be the spectral measure in Cp, and let Hy, Ey be 
defined as in §2. We set 


F(t) 


(21) Py = $(£)), 

Py. = $(£)-) 
for all Borel subsets 7 of [0,1] and all A€[0,1]. It is immediate that 
(22) F,=F([0,A]), Fy-=F([0,A)) 


Furthermore, if +, 71, rz are Borel subsets of [0,1], 7. is the empty set, and 
K is the constant occuring in (20), we have 


(i) F(r)=0, F([0,1])=T7; 
(ii) t2) = F (41) F (72), 
F(7,U 72) =F (71) + F(r2) provided t2 = 1703 
(ii) | SE. 


Conditions (i), (ii) follow from the corresponding properties of H(r), and 
(iii) is seen to be a consequence of (20) and (21) when we recall that 
E(r) € Gp and that || =1. 

We may now deduce from a theorem of Dixmier ([1], p. 222: see also 
[4], [6]) that it is possible to find a Hilbert space Hz, and a one to one 
bicontinuous linear operator P from H,; onto Ho, such that 


(23) G(r) =P-F(r)P 


ISOMORPHISMS BETWEEN ORDERED BASES. AGT 


is a self-adjoint operator for each Borel subset z of [0,1]. It is clear that 


H, is separable. 
(c) We define 


PZ .P, a; = 
We shall prove that J, is an ordered basis with diagonal Cs. 


Since the operators Fy), Hy (OSAZ1) form a full set of hull idem- 
potents for Jp, it follows from Lemma 2 and from equations (21) that F), 
Fy. (0OSA=1) are a full set of hull idempotents for J.. Hence J; 


consists of all operators acting in H, which leave invariant the ranges of the 
idempotents Ff), Fy). (OSAS1). Thus, if we define 


(24) Gy = Gy- = 


then J; consists of all operators acting in H; which leave invariant the ranges 
of the idempotents Gy, Gy (OSAS1). Hence 


(25) {TC(Hy, Hs): GTO, ATH. 1)}. 


From equations (22), (23) and (24) we deduce that G,—G([0,A]), 
G([0,A)). Hence the idempotents G), are projec- 
tions, and in fact form a linearly ordered family of projections. If we now 
prove that the von Neumann algebra @; generated by these projections is a 
maximal abelian sub-algebra of (H;,H;), we shall be able to deduce from 
Theorem 3.1.1 of [3] that J; is an ordered basis with diagonal 63. 

When S€ J>p we define 


(26) ¥(S) = P*4(8)P. 

Thus y is an algebraic isomorphism from Jp onto Js, and 

(27) ¥(Ap) =P*4(Ap)P 

Furthermore, 

(28) = G- 1). 


Let @’, denote the commutant of @;. Then 
6’, ={T€(Hs,Hs): TGQ =QT, (03451)}. 
It is clear from (25), and the preceding line, that @’; C J;; hence 


Th =GQT,TH—GT (0SAS1)} 


= (Gp) 
= 
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Thus @’; is abelian, and contains @; (since G,,G,-€ @;). Since, by the 
double commutant theorem, 6;— 6”;, we deduce that @3;—=(4’;)’ C 6’s. 
Hence 6; It easily follows that (=(;) is a maximal 
abelian sub-algebra of (H;,H;), and hence that J; is an ordered basis with 
diagonal 


(d) From (20) and (26) it follows that y, as a mapping from Cp onto 
G;, is continuous if each of these algebras is considered with its uniform 
topology. Since 


y(E(r)) =G(r), 


we deduce that y is adjoint-preserving on the uniformly closed sub-algebra 
of Gp which is generated by the projections H(+). But this sub-algebra is 
in fact the whole of Gp (as follows from the spectral theorem for normal 
operators, or by elementary arguments based on the fact that Cp is a multi- 


plication algebra). Hence 
y(T*) =y(T)* (TE Gp). 
Thus y is a symmetric isomorphism from Jp onto Js, and, by Theorem 2, 
is spatial. From equation (26) it is now immediate that ¢ is spatial. This 
proves Theorem 3. 
UNIVERSITY OF DURHAM, 


COLLEGE, 
NEWCASTLE UPON TYNE 2, ENGLAND. 
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SOME GLOBAL PROBLEMS IN THE THEORY OF FUNCTIONS 
OF SEVERAL COMPLEX VARIABLES.* 


By Errett BisHop.* 


1. Introduction. In a previous paper [2] the global structure of a 
complex analytic variety was examined in some detail. The present paper 
is a sequel containing some application of those results. In order that this 
paper be self-contained, the definitions and results of [2] are summarized in 
Section 2. 

Consider now an analytic subset LZ of a Stein manifold K. It is a 
consequence of Theorems A and B of [3] that 

I. The set Z is the set of common zeros of some family [ of analytic 
functions on K. 

Il. If Z is a submanifold of K then every analytic function on ZL can 
he extended to an analytic function on K. 


Previous work on Problems I and II was done by Oka [5], [6], and 
others. In Section 3 we give new proofs of I and II which are indepen- 
dent of sheaf theory. The proof of I contains an explicit formula for 
the functions in the class T, and in this respect gives information which 
is not available from the sheaf-theoretic proof. Similarly the proof of II 
contains a specific formula for the extension of a function analytic on L to a 
function analytic on K. This formula makes it clear that II continues to hold 
for analytic functions whose values lie in a Banach space, thus answering a 
question posed by H. Rossi in a conversation with the author. 

In Section 4 we prove that the first cohomology group of the sheaf of 
germs of holomorphic functions on a Stein manifold is trivial. Again this 
result follows from sheaf theory [3|. Previous work was done by Cousin [4], 
Oka [5], [6], [7%], and others. Since it is the purpose of this paper to 
proceed independently of sheaf theory, this result is formulated and proved 
without mention of sheaves or cohomology. As is well-known, the solutions 
to Cousin’s first and second problems are consquences of this result. 

The techniques which we use for solving these problems are all essentially 


similar. Since all of the problems considered have previously been solved, 
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we have not attempted to push the theory to the limit of its generality. 
It should however be clear to someone familiar with the subject that the 
methods employed here are capable of handling the same problems in greater 
generality. These methods, which seem not to afford so powerful a general 
approach to these questions as does the theory of sheaves, nevertheless have 
the advantages of explicitness and simplicity. The gain in explicitness comes 
from the use of explicit formulas to construct the various analytic functions 
which arise. These formulas make it clear, for instance, that all of the above 
mentioned results hold for Banach space valued analytic functions. 


2. Preliminaries. In order not to burden the exposition with too much 
generality (as was done in [2]), we confine our attention to Stein manifolds, 
merely remarking that there are obvious possibilities for generalizing the 
results to one type or another of analytic space. 


Definition 1. A Stein manifold K of dimension n is an n-dimensional 
separable complex analytic manifold K having the following properties. where 
A(K) denotes the set of analytic functions on K 


(a) For each pair (p,q) of distinct points in K there exists f in A(K) 


with f(p) Af(q). 
(b) For each p in K there exist f;,- - -,f, in A(K) which map some 


neighborhood of p homeomorphically onto an open set in n-dimen- 
sional complex affine space C”. 

(c) K is holomorphically convex. This means that for each compact 
subset S of K the set 


S={p: p€ K,| f(p)| Ssup {| f(q)|: 9} for all f in A(K)} 
is compact. 


We now state, for the special case of a Stein manifold, some of the 
definitions and results of [2]. 


Definition 2. Let K be a Stein manifold and L an analytic subset of K. 
Let f:,- - be functions in A(K). Write 


Po= {pe L:|fi(p)| <1,1StSh}. 


Let P be an open and closed subset of P, such that P is a compact subset 
of Z. Then P is called an analytic polyhedron in L defined by f,,° - -, fr. 
The analytic polyhedron P is called special if L has the dimension k at each 
of its points. 
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TueorEM 1. If P is an analytic polyhedron in a homogeneous-dimen- 
sional analytic subset L of a Stein manifold K, if S is a compact subset of P, 
and if T is a compact subset of K at each point of which each of the defining 
functions for P has absolute value less than 1, then there exists a special 
analytic polyhedron Q in L with 


SCQCP 


such that each of the functions defining Q has absolute value less than 1 at 
each point of T. The functions defining Q can be chosen to be polynomials 
in the functions defining P. 

Since it is easy that a homogeneous-dimensional analytic subset L of a 
Stein manifold K can be exhausted by an increasing sequence of analytic 
polyhedra, it follows from Theorem 1 that Z can be exhausted by an increasing 
sequence of special analytic polyhedra. Thus the study of Z can be reduced 
for most purposes to the study of special analytic polyhedra in LZ. It turns 
out that a special analytic ployhedron P can be studied in detail by means 
of the map f of P into d-dimensional complex affine space C%, where 
d==dim LZ and f is defined by 


f(p) = (f:(p)," +> falp)) 


with f,,- - -,fa the functions which occur in the definition of the special 
analytic polyhedron P. Before we state the main results about special analytic 
polyhedra another definition is necessary. 


Definition 3. Let X be a topological space. Let “X be the k-fold 
Cartesian product of X with itself. Define an equivalence relation on *X by 
taking ~ (415° *;Yx) to mean that (y1,- -, yx) is a reordering 
of Then ,X, the k-fold unordered product of X with itself, 
is the set of equivalence classes of *Y under this relation. A set in ;.‘\ is 
called open if and only if its inverse image in “XY with respect to the natural 
map 7: is open. The image under of the element 
in *X will be denoted by {21,° -, 2x}. 


The following three theorems give the fundamental facts about analytic 


polyhedra. 


THEOREM 2. Let P be a special analytic polyhedron in a d-dimensional 
subset L of a Stein manifold K, defined by functions f,,---,fa im A(K). 
Let f be the map 

p— (filp),: fa(p)) 
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of K into C4, so that f(P) CE4, where 
Et = +, 2a): |%|<1,1SiSd}. 


Then there exists an integer A, a continuous map » from £¢ into ,P, and a 
closed nowhere dense subset H of EH such that 


(a) For all z in the points p,,- - -,p, in o(z) = {pi,° pra} are 
mapped by f into z. 

(b) No other points in P are mapped by f into z. 

(c) For all z in #4—H the points p,,- - -, py are distinct. 

THEOREM 3. With the conditions of Theorem 2 let J be the subset of 
P consisting of all regular points p of P such that the functions f,,- - -, fa 
are a coordinate set of functions at each of the points p,,- - +, py of o(f(p)) 
={p.,°°-*,p}. Let g be a continuous function on P which is analytic at 
each point of J. Then g 1s analytic at each regular point of P. If also 
9(p:) =9(p2) whenever p, and pe are points in P for which f =f (pe) 
and if go 1s a function on an open subset U of f-*(H*) such that go(q) = 9(p) 
whenever g€ U, pe P, and f(p) =f(q), then go ts analytic on U and the 
function h on E4 for which g =hof ts analytic on E4. 


THEOREM 4. Let T be a compact subset of a Stein manifold K and U 
a neighborhood of T such that for each p in U—T there exists f in A(K) 
with 
| f(p)| > sup {| |: T}. 


Let C(T) be the uniformly-normed Banach space of all continuous compler- 
valued functions on T. Let & be the closure of A(K) in the space C(T). 
Then the spectrum of the Banach algebra UM is T. That is, to any continuous 
homomorphism of & into the complex numbers corresponds a point po in 


T with (f) =f (po) for all f in 


The proof of Theorem 4 (Lemma 6 of [2]) was for certain special sets 
T, but by means of Theorem 1 one can easily approximate 7 by such special 
sets and thereby obtain Theorem 4 for T. 

In order to study a d-dimensional analytic subset Z of a Stein manifold 
K globally it is sometimes necessary to consider a map of K into C4 whose 
restriction to L is almost proper. This concept is explained in the following 
definition. 


Definition 4. A continuous map f of a topological space X into a 
topological space Y is called almost proper if every component of f-*(S) is 


compact for every compact subset S of Y. 
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The following theorem is easily proved from the fact that an analytic 
subset of a Stein manifold can be exhausted by special analytic polyhedra. 


THeEorREM 5. If L is a d-dimensional analytic subset of a Stein mantfold 
K then K admits an analytic map into C4 whose restriction to L is almost 
proper. The mapping can be chosen to be non-singular on L at each point 
of any 0-dimensional set of regular points of L. 

The final result we shall need guarantees the existence of a function 
fas. Which will be used as a convergence factor for constructions involving 
an almost proper map f of an analytic subset LZ of a Stein manifold. 


THEOREM 6. Let f= (fi,: - +, fa) be an analytic map of a Stein mani- 
fold K into C*, whose restriction to the analytic subset L of K is almost proper. 
Let L have dimension d at each point. Let {S;,} be an increasing sequence 
of compact subsets of L, each of which has finitely many components, the 
union of whose interiors is L. Let a be a function from the positive integers 


to the positive real numbers such that 
= {pe L:| fi(p)| Sa(k), 1 StS d} 
for all k. Let H;, be the union of all those components of F;, which intersect 
S;. Then H;, is compact. For each positive integer k write 
Gy = (Ai Fy) — Ax. 


Let Go=H,. Let {gx} be a sequence of functions in A(K) and {ex} a 
sequence of positive real numbers. Then there exists fa, in A(K) such that 


In addition fa,, can be chosen to have distinct values modulo 2zi at the points 


of any countable subset of K. 


Since it is not clear that Theorem 6 as stated follows from the results 
of [2], we indicate the modifications necessary in the proof of Theorem 8 
of [2] to give Theorem 6. The only point in question is the convergence 
on K (and not just on LZ) of the series 


fas 9k 
k=0 


used to define fz, on p. 238 of [2]. To see that this series can be made to 
converge on K, let D be any compact subset of K, with D—D. We shall 
show that for & sufficiently large the g; can be chosen so that | g.(p)| < 2* 
for all p in D. It will be clear that this can actually be achieved for any 
sequence of such compact sets D which exhausts K, from which the uniform 
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convergence of the series for fg, on all compact subsets of K can be obtained. 
To this end it is sufficiently to show that the function + constructed in the 
proof of Theorem 8 of [2] and used in the construction of g;, can be made 
arbitrarily small on D if & is sufficiently large. That is, we wish to find ; 
in & which is arbitrarily close to 0 on H;,U D and arbitrarily close to 1 on G;,. 

Choose & so large that DO LCH;,. Thus there exists a neighborhood 
U in K of T=H,UG,UD such that H, UG,;,=—F,NU. Clearly G;, is 
open in the set 7. Thus to show that 7 exists with the desired properties 
it is sufficient by Theorem 4 above to show that for each p in U —T there 
exists h in 2 with 

| h(p)| > sup{| h(q)|: € TH. 


If pe UNL then pe L — F;,, so that h can be taken to be one of the functions 
f1,: - *,fa. Thus we need only consider the case pe U—L. It is well 
known, and a new proof will be given below (Theorem 7), that there exists 
an analytic function h, on an arbitrary open relatively compact set VCK such 
that hi(p) =1 and h, vanishes on VM L. Now since D — D there exists h, 
in % with ho(p) =1 and |h,(q)ho(q)| <1 for all g in D. Now if V is 
sufficiently large then by Runge’s theorem for several complex variables (a 
new proof of which will be given below which does not depend on this theorem), 
the function hh. can be uniformly approximated on TU {p} by functions 
h in M, so that h will have the desired property. This completes the proof 
that the desired function + can be found. 


3. Analytic sets in Stein manifolds. We wish to prove Theorems | 
and II of the introduction. We first prove somewhat weaker results, I, and 
II, below, which are sufficient for most applications. 


THEOREM 7. Let L be an analytic subset of a Stein manifold K. Let 
U be an open subset of K whose closure is compact. Let L have the same 
dimension d at each of its points. Then 


I,. For each po in U—L there exists an analytic function h on U, 
defined by (*) below, which vanishes on UN L, with h(po) #0. 

Il,. If in addition L is a submanifold of K then every analytic function 
u on L can be extended to an analytic function i on U, given by 
(a), (d), and (e) below. 


Proof. Since K is a Stein manifold and U is compact there exists an 
analytic polyhedron Q in K with UCQ. Thus Q/NL is an analytic poly- 
hedron in Z. By Theorem 1 there exist functions f,,- - -,f¢ in A(K) defining 


{ 
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an analytic polyhedron P in L with UM LCP and | fi(p)| <1 for p in U 
and 1=i=d. Let f be the map 


of U into E* and let f, be the restriction of f to P. Let A and o respectively 
be the multiplicity and multiplicity function, as defined in Theorem 2, of 
the map f, of P into Consider in U—L and write —f(po). Since 
K is a Stein manifold there exists w in A(K) such that w(po.) ~w(p) for 
all p in w(Z). 

For an arbitrary point q in U let o(f(q¢)) Define the 
function h on U by 


(*) i(g) = (w(a) —w(a)). 


Thus is a polynomial in w whose coefficients aj, 0=7 SA, are symmetric 
functions of the g;. Clearly a; is a continuous function on P which is analytic 
on L at each point of the set J defined in Theorem 3. By Theorem 3 the 
functions a; are analytic on U. It follows that h is analytic on U. Clearly 
h(po) #0 and h vanishes on LO UCP. This proves I. 

To prove IJ,, define f,,- - -,f;, and P as above. Choose 7) with0 1 
such that | fi(p)|<70, 1SiSd, pe U. Choose a real number r with 
Write 

T= {pe 


whenever 0<7r<1. Let po be any point of 7. Since K is a Stein manifold 
there exist h,,- - -,h@ in A(K) whose restrictions to LZ are a coordinate set of 
functions on L at the point p). Thus for all sufficiently small «40 the 
functions 

Gi=fit chi 


are a coordinate set of functions on LZ at the point po. It follows from the 
choice of 7, that | gi(p)| 1 for all p in U if is small enough. 
Now fix with r< 1, <1. Consider the sets 


and 
R,= {pe 4d}. 


If ¢ is small enough then R, and R, are analytic polyhedra in Z defined by 
multiples of the functions g,,- - -,ga with R, CTCR,CP. Let g be the map 
*,ga) of K into C4 and let be the restriction of g to R,. From 
Theorem 2 it follows that the map g of R, into r,/% has a definite multiplicity 


p— (filp),* fa(p)) 
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and multiplicity function Since g,,- --,ga form a coordinate system 
at po, the point p, occurs with multiplicity 1 in the set w(g(po)). Let w be 
any function in A(K) whose value at p, is different from its values at the 
other points of w(g(po)). 

For an arbitrary qg in g-1(r,Z%) let q:,: - -,q, be the points of the set 
w(9(q)) * counted according to multiplicity. Let u be any 
function analytic on Ry. Define the function @ on g*(r,#%) DU by 


(a) = 2 [u(a) (w(q) —w(q;))]. 

i= 
Now is a polynomial in w(q) whose coefficients a)(q),- are 
analytic functions of q at each point g in g*(roH#*%) for which qi,° - -,q, are 


disinct points of Ry. By Theorem 3, the functions a; are analytic on U. 
Therefore a is analytic on U. For gin UN LCR, the point q is one of the 
points g:,° in w(g(q)), say In this case it is clear that 


(b) =u(q)w(q) 
where 


is an analytic function on F,, it being understood that gq, is chosen to equal 4 
for g in R,. Remembering that p, occurs with multiplicity 1 in the set 
»(g(po)) and that w takes on values other than w(p,.) at the other points of 
»(g(po)), we see that w(p.) 

Thus for each p, in T there exists a function w analytic on R, with 
W(po) ~O such that for every analytic function wu on R, there exists an 
analytic function « on U which agrees with ww on UNL. Let B be the 
uniform closure on 7’ of all functions analytic on R, From Theorem 4 we 
see that the spectrum of B is T. Since the function #, which is analytic 
on f,, can be chosen to be non-zero at any given point of 7, there exist a 
finite number #,,° - -,Wy of such functions which have no common zero in 
T. Thus from the general theory of Banach algebras there exist elements 
° *,¢y of B with 
(d) +: —1. 


For each j, 1S jSN, let a; be the function defined by (a) with w 
replaced by w; and u replaced by cju. Thus 


N 
(e) ax Sz, 
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is analytic on U, and for all gq in UN L we have, by (b), (d), and (e) 


= Ju(q) =u(q). 


This completes the proof of I1,. 


Remark. Theorem II, holds in the following greater generality. Let 
V be a complex vector space and W some set of linear functionals on V. 
Define a function wu with values in V to be analytic if ¢°w is analytic for all 
¢ in W. Then Theorem II, holds for analytic functions with values in V. 
We now prove I and II. 


THEoREM 8. Let L be an analytic subset of a Stein manifold Kk. Let 
L have dimension d at each of its points. Then 


I. For each po in K—L there exists an analytic function h on K, 
defined by (*) and (%) below, which vanishes on L but does not 
vanish at po. 

Il. Jf L is a submanifold of K then corresponding to every analytic 
function u on L there exists an analytic function i on K with 
i(q) =u(q) for all gq in L. The function i is given by a con- 
glomeration of formulas which are summarized by * below. 


Proof. By Theorem 5, there exists an analytic map f= (f1,° - -,fa) 
of K into C¢ whose restriction f to L is almost proper. If pé€ K then f*(f(p)) 
is countable and consists of isolated points. Let p.,---,pn,--~- be the 


points of f*(f(p)), each being counted a number of times equal to its 
multiplicity for the mapping f. The multiplicity of a point q for the 
mapping f is defined by considering a special analytic polyhedron Q in L 
containing q defined by multiples of f,,- - -,f¢ and taking the multiplicity 
of q for the analytic polyhedron @; this multiplicity does not depend on the 
choice of Q. 


Let {S;,} be a sequence of compact subsets of L, each of which has 
finitely many components, with S;CS;,, for all & and U,intS,;=L. For 
each positive integer k let H; be the union of those components of f-!(a(k) £4) 
which intersect S;,, where >k-+1 is chosen so that f(S,)C a(k) F4. 
Thus S,;CH,. Write 


Gy, = (7 (a(k) — Hy 


4 


for all positive integers k. Now intH,—=H;,f"(a(k)E%) is an analytic 
polyhedron defined by the functions a(k)-f,,- - -,a(k)“fa. Let Ax be the 


— 
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multiplicity of int H;. Thus for each p in K the number of points in the 
sequence 7,2,‘ *~* belonging to intH; is either A, or 0, depending on 
whether f(p) is in a(k) #4 or not. By Theorem 6, there exists fa,, in A(K) 
such that 

| (p) — (Aner +? +1)| <1 


for all p in G; and all k. Therefore 
(faun(p)) 2 +H? 


for all p in Gy. If po is any point in K —L, we may choose fg,, so that in 


addition 


exp (Po) ) A exp (q) ) 
for all g in f-*(f(po)). Write 


w = exp (— 
so that w€ A(K) and 


| w(p) | S — S 


for all p in G,. Choose ky = 2 with f(T’) C k,.H4, where T is a compact sub- 
set of K to be fixed later. Since {/;,} is an increasing sequence of sets whose 
union is K, there exists for each g in f-*(f(7)) a smallest & with q€ Hy. 
If k > k, then g€ in fact, g € int because f(g) € so that 
0 Gy. C int G,.. If then similarly q€ int H;,. Thus 


7(f(T')) C int H;,U U int Gy. 


Now for each p in K and k=1 define 
(*) hy(p) = 11(1— w(pi) [w(p)]*), 


where the product is taken over all p; in F;, with F, H, and Fy = A, — Hy, 
for k>1. Since at most A; of the p; belong to H; and since w does not 
vanish on K, we see that the functions h; are well-defined. We shall show 
that the product 


(3) h(p) IL 


converges uniformly on all compact subsets of K and defines a function h in 
A(K). Assuming for the moment that this is so, it follows from the fact 
that p = p; for some 1 whenever p€ L that h vanishes on L. It follows from 
the fact that w has values none of which equal w( ) at the points of #1 (f (po) ) 
that h(po) #0. Thus h has ali of the desired properties. It remains to 
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prove the convergence of the product ($) to a function h in A(K). For this 
purpose it is sufficient to consider an open subset U of K whose closure T 
is compact. Fix T to be the compact set considered above. Write 


ho(p) —w(pi) [w(p)]-*), 


where the product is taken over all p; in int H;,. Thus by the above we see 


that for all p in T 
ko 
ho(p) = (p). 


Similarly 


hy(p) = — Lw(p)1*) 


for all p in 7 and k=ko, where the product is taken over all p,; in int G,. 
Now for k= ko, hy is a polynomial in [w(p) ]-* whose coefficients we denote 
by aj;(p). Clearly a; is an analytic function of p for all p for which the 
points p; in G; are distinct. Since int G; is an analytic polyhedron defined 
by a multiple of the functions f,,- - -, fa, it follows from Theorem 3 that h, 
is analytic on f*(f(intG;,))> U. Similarly hy is analytic on U. Thus we 
need only check the uniform convergence of the product 


II hx 
k=ko 
on l’, To do this we may equally well consider the series 


in Gy: | 
k=kg 


since | w| is bounded below on the compact set 7. The sum over p; in G; 
contains at most Az. terms, each bounded by (kAxz,1)~1, and so is at most k-’. 
Thus the series converges absolutely and uniformly on 7. This completes 
the proof of I. 

We turn to the proof of II. Define {S,}, f1,-- -,fa f, Gr, Hi, T, pr, 
* Ags and w as above. For p in L choose p,—p. In addition 
choose f:,° - *,fa to be a coordinate system on L at p for all p in a given 
(-dimensional subset of Z. Also choose w to have distinct values at the 
points of f7(f()). For each positive integer & define the function h; on 
K by 

he(p) = u(pi)e(pi) d (pi) (1— w(p;) [w(p) 


where the product and sum are taken over points p; and p; in H;. Here c is 
a given function in A(Z) and d is a function in A(K) which will be deter- 


9 q 
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mined later. Let UCK have compact closure T. Consider k& so large that 
f(T) Ca(k)E*. Now if f(p) € a(k) then f(p,) =f(p) € for all 1. 
Thus we see that p;€ int H; whenever f(p) € a(k)H4 and p,€ H;. From this it 
is clear that each of the coefficients a;(p) in the expansion of h,(p) in powers 
of [w(p) ]~ defines a continuous function a; on f-!(a(k)E*%) which is analytic 
at every point p of f*(a(k)£*) which has the property that the points ), 
which lie in H;, are distinct. By Theorem 3, a; is analytic on U Cf*(a(k)E*). 
Thus h, is analytic on U for all & sufficiently large. If j > k then 


| hi(p) —h(p)| AB+ CD(E—1), 


where A is the sum 

taken over p; in L —H,, B is the product 

TT = + | | | w(p)|*) 
taken over all p;, C is the sum > taken over p; in Hx, D is the product J] 
taken over p; in H; and E is the product JJ taken over p, in L—H,. 
Thus to show uniform convergence of the sequence {h;} on the set U we 
must show that the terms AB and CD(H—1) converge to 0 as kx. We 
consider only the term AB, since the treatment of the other term is entirely 
similar. Now the product B is finite and bounded independently of p if 


p€U, as may be seen by considering the sum 


b> | w (pj) |; 


where the factors |w(p)|-* have been left out because | w| is bounded 
independently of p in U. Thus to show that the sequence {h;} converges 
uniformly to 0 on U as k—oo we need only show that A—0 uniformly on U 
as k-—»o. This will be done by an appropriate choice of the function d. Let 


Bi =sup{| u(p)c(p)|: pe Gy}. 
By Theorem 5 there exists f’g,, in A(K) such that 
| (Pp) <1 


for all p in Gy, so that Let d—=exp(—f'as), so 
that d€ A(K) and d vanishes nowhere on K. Also 


| 4(p)| < 
for all p in G,. Now if p€ U and k, is chosen so large that f(T) C k,.H4 
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each p; in L—H;,, is in some G; for k>k,). Therefore the sum A splits 
into partial sums A, taken over the sets G, for k= ky. Now at most Ax. of 
the points p; lie in G,. It follows from the choice of d that 


A, = (hk? = k-?, 


Thus A= >k°. Hence {hx} converges uniformly on U. It follows that 


{hy} converges uniformly on compact subsets of K to a function @ in A(K). 
For p in L we have, remembering that in this case p, is chosen to be p, 


hi (p) (1— w(p;) [w(p)]*), 


where the product is taken over all p; in H;. Thus, passing to the limit, 


a(p) =u(p)e(p)d(p)w(p), 


where 


w(p) -1 (1—w(p,) [w(p) 


By the choice of f1,- - +, fa, ppAp for j=2 if p—p. By the choice of w 
it follows that w(p) ~0. 

We have thus found a function # = w, on L such that for any c, in A(L) 
there exists d, in A(K) vanishing nowhere such that there is a function @, 
in A(K) which equals uwc,d,w, on L. We may choose w, not to vanish at a 
0-dimensional subset of Z which contains points in each component of L. 
Thus the set Z, of zeros of w, is of dimension at most d—1. Now choose 
2 to have the same properties as w, and to be non zero at a 0-dimensional 
subset of L, containing points in each analytic component of Z,. Continuing 
in this fashion we find #,,- - -,Wq having the above properties and having 
no common zero on L. By a theorem of Arens [1] ? it follows that there exist 
C1,° in A(L) with 

> cw; = 1. 


By the choice of w;, there exists d; in A(K) vanishing nowhere on A(K) 


such that wed; has an extension from Z to an analytic function % on K. 
Write 


* The possibility of using Arens’ theorem here was pointed out by Kenneth Hoffman. 


ad 
i= > (d;) 
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so that 7€ A(K). For p in L we have 


—u(p) —u(p). 


This completes the proof of II. The final formula reads 


Remark. As is well-known, II (in fact II,) can be used to prove Runge’s 
theorem for Stein manifolds. The proof goes as follows. Let T be a compact 
subset of a Stein manifold K and h a function analytic in some neighborhood 
U of T. Runge’s theorem says that h can be uniformly approximated on T 
by functions in A(K) if for each p in U—T there exists in A(K) with 
'g(p)| > max {| 9(q)|: T}. To show this notice that the assumption 
implies that there exists an analytic polyhedron P in K with 


Let - be the functions defining P. Choose -, fn in A(K) to 
separate points of U and to have rank equal to the dimension of K at each 
point of U. We may assume that | fi(p)| <1 for all pin T, k+1SiSn. 
Thus the map f = (f1,: *,fn) of K into C" maps T into Also M=—f(P) 
is a submanifold of #”". Let 7 be the analytic homeomorphism of M onto P 
which is inverse to f, and let wu be the analytic function ho? on L. By II, 
wu admits an analytic extension i to H”. By a well-known and elementary 
use of the Cauchy integral formula, @ can be uniformly approximated on 
f(T) by functions v analytic on #”. Thus the functions vof in A(K) 
uniformly approximate h on T, as was desired. 


4, The resolution of Cousin distributions. As is well-known, the solu- 
tion on a Stein manifold of Cousin’s first problem, which consists of the 
global construction of a meromorphic function whose principal parts are 
locally given, can be obtained from the fact that on a Stein manifold the 
sheaf of germs of analytic functions has a first cohomology group which is 
trivial. Cousin’s second problem, which states that a globally defined mero- 
morphic function exists whose quotients with certain locally given mero- 
morphic functions are analytic functions without zeros, can also be solved 
using this statement about cohomology. Thus we shall restrict ourselves in 
this section to showing that the cohomology group in question is trivial. This 


will be done without the use of cohomological terminology. 
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Definition 5. Let U be an open subset of a Stein manifold K. A 
Cousin distribution on U will consist of a covering {Ua} of U by open sets 
and an assignment for each « and £ of an analytic function fag to the inter- 
section Ua M Ug such that 


(a) fag + = 9 
and 
(b) fog t+feytfye=0 on UanUgnUy. 


for all a, 8, and y. By a resolution of the given distribution {Ua}, {fag} on 
an open set S C K we mean a family {fa}, where fa is analytic on SN Ua, 
such that 

fog = fa — fp on SNUaNnU 


for all « and £. 


By a refinement of the distribution {Ua}, {fag} we mean a Cousin dis- 
tribution {V1}, {g ys} and a mapping o of the index set of {V.} into the 
index set of {Ua} such that 


(c) Vy C 
and 
(d) = on V5 


for all y and 8. If T is a compact subset of K then a Cousin distribution 
on 7 means a Cousin distribution on some neighborhood U of T, and a 
resolution on 7’ means a resolution on some neighborhood of T. The same 
definitions will be taken to hold if the functions fag etcetera are replaced by 
polynomials of given degree in an indeterminate X whose coefficients are 
analytic functions on the appropriate sets. 

We make use below of the trivial fact that a Cousin distribution which 
has a refinement with a resolution itself has a resolution. 

The cohomology theorem mentioned above is just the statement that 
every set of Cousin data on a Stein manifold K has a resolution on K, and 
it is this fact which will be proved below. As a tool we shall use the fact 
that every Cousin distribution on the subset #” of C” has a resolution. This 
is well-known to be a completely elementary fact and indeed was essentially 
proved by Cousin [4]. For completeness however we sketch the proof at the 
end of this section. The following lemma contains the essence of the 
theorem to be proved. 
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Lemma 1. Let T be a compact set of a Stein manifold K and A a 
neighborhood of T such that for all p in A—T there exists f in A(K) with 


f(p) > sup {| f(q)|: T}. 


Then any Cousin distribution on T has a resolution on the wntervor M of T. 

Proof. Let the given Cousin data {Ua}, {fag} be defined on an open 
set UDT. Let P be a special analytic polyhedron defined by functions 
in A(K) with TC PCPCU. Choose a covering {V,} of 
so fine that each component of W,—P 1 f-*(V,,) lies in one of the sets Ua 
for each y. Since P is a special analytic polyhedron, each W., has only a 
finite number of components, which we denote by W,7,- - -, Wz’, where ¢ 
depends on y. For each of these components W;7 choose Uagy in {Ua} 
with Wi? C Uaiy). For each y and 8 define the analytic function hs on 
Ws by 

hys(p) = 


for all p in Wi7N W;*. It is easy to see that {W,}, {hs} is a Cousin dis- 
tribution on P. For each point z in £4 let p,,- --,p, be the points of 
w(z) ={pi1,- > -,p,}. Let w be any function in A(K). Let X be an 
indeterminate. For each y and 6 and each z in V,/ Vs define 


(2, X) hoo ( Ps) i (X — w(pj)). 


By Theorem 3 we see that the coefficient of each power of X in the poly- 
nomial gs in X is analytic on Vy V5. Since {Wy}, {hs} is a Cousin 
distribution on P we see that 
+ = 0 
and 
95 + Jyo + Joy =0 


for all y, 8, and o. Thus {V,}, {gs} is a Cousin distribution for H%. By the 
statement made above (whose proof will be outlined below), there exists a 
resolution {g,} of this Cousin distribution. Now {W,7} is a covering of P. 
For each define the function Fj, an W,7 by 


Fiy(p) =9y(f(p), w(p)). 


These functions are the resolution of a Cousin distribution {W,7}, {Fi,j5} on 
P, where Fis is the function 


Fiy(p) — Fi5(p) = 
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on If for each p in P we write w(f(p)) = pa}, with 
pi =p, it follows that 


= hys(p) Il (w(p) —w(p;)) 
= W(p)hys(p) = W(p) (P); 


where w is the analytic function on P defined by 


=I (w(p) 


Thus {F;,j;5} is a resolution on P, and therefore on M, of the distribution 
Since this distribution is a refinement on M of the 
distribution {Ua}, {Wfag}, it follows that the latter distribution has a resolu- 
tion on M. 

If po is any point in T we may assume that f,,- - -,f@ were chosen so 
that f has multiplicity 1 at p, and that w was chosen to distinguish py from 
the other points of w(f(po)), so that w(po) #0. Thus we can find a finite 
number of functions #,,: - -,Wn, each analytic in some neighborhood of 7 
and having no common zero on 7’, such that for 1 Si 7 the distribution 
{Ua}, {Wifeg} has a resolution on M. Thus for each c; analytic on M the 
distribution {Ua}, {Wicifag} has a resolution on M. By Theorem 4, the ¢; 
may be chosen ith It follows that distribution {Ua}, {fog}, which 
on M is therefore the sum of distributions which have resolutions on M, has 
a resolution on M. This completes the proof. 


THEOREM 9. Any Cousin distribution on a Stein manifold K has a 
resolution on K. 


Proof. Since K is Stein, there exists an increasing sequence {M,} of 
open sets whose union is K and whose closures 7, all have the property of 
the set T of Lemma 1. Thus for each n by Lemma 1 the given Cousin 
distribution {Ua}, {fag} has a resolution {fa"} on M,. We may assume that 
the closure of each Ua is compact. Let {S,} be an increasing sequence of 
compact sets whose union is K, which have the property of the set 7’ in 
Lemma 1, and which satisfy the inclusion S,C M/,. We show now by induction 
on n that for each n there exists a resolution {he"} of {Ua}, {fap} on Mn 
such that 


| ha”*(p) —ha"(p)| <2 


for all p in S,,. Assume that the he” have been found for all integers 


U 
| 
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smaller than a given integer n. For each p in My, choose a with p in U. 


and write 
9o(p) =ha"*(p) —fa"(p). 


It is easily seen that g, is an analytic function on M,.. By Runge’s Theorem 
there exists g in A(K) with 


| 9(p) —go(p)| <2" 


for all p in S,4. It follows that the functions {ha"} have the required 
properties, where he" = fe" +g on Mn. Since the closure of each Ux 
is compact, it follows that for each fixed « the sequence {ha"} converges 
uniformly on Ue to a function {fa}. These functions clearly form a resolu- 
tion on K of the given Cousin distribution. This completes the proof. 

Remark. If a meromorphic function with values in a Banach space is 
defined to be one which is locally a quotient of an analytic Banach-space 
valued function by an analytic scalar-valued function, then Cousin’s first 
and second problems can be solved for such functions by applying Theorem 9 
for Banach-space valued Cousin distributions. 

To finish we indicate the technique for obtaning a resolution of a Cousin 
distribution on £. Let D be the set 


€ + ign, | | << 1,|9.|< 1,152" d}. 


Since D¢ is conformally equivalent to #4, we may equally well consider the 
resolution of a Cousin distribution given on D¢. In fact we may assume 
that the distribution is given on the closure H4 of D4, as the proof of Theorem 
9 shows. Let the given distribution be {Ua}, {fag}, given on H*. We first 
introduce the notion of a soft resolution for a Cousin distribution. This is 
the same as a resolution except that the functions fz are permitted to be the 
arbitrary (not necessarily analytic) functions. It is easy to see that every 
Cousin distribution has a soft resolution. Let {F}a be a soft resolution of 
the given distribution. If U is an open set then it is easy to check that the 
given distribution has a resolution on U if and only if there exist a function 
F on U such that F — F, is analytic on Ua2N U for all Let 2, + 
1=n=d, be the coordinates in C*. Let N be a positive integer and write 
S,(k) = {z€ H4: —1 
N N 
for 1=kS2N and 1=n2d. For each choice of with 
1=k;S 2N, write 


FUNCTIONS OF SEVERAL COMPLEX VARIABLES. 497 


Choose NV so large that the covering {S(ki,- - -,k2a)} of H4 is a refinement 
of the covering {Ua} of For each -,k, with 1k; and 
0StS 2d, let 


so that S() = H*. We shall show that the given Cousin distribution has 


a resolution on each S(k,,- - -,k;). This is true if ¢ = 2d because the covering 
{(S(ki,° +, is a refinement of {Ua}. Assume therefore that it is true 
for all ¢ greater than some given ¢, so that we must show that there is a 
resolution on each S(k,,- - -,k:). To this end write for each 1=k=2N 


k k 
j= = 


S(k,,- 1) =§(k,,- ‘,ki3k) U ‘, +1) 


By the induction hypothesis on ¢, it follows that the given distribution has a 
resolution on S(k,,- - -,4:31). We now show by induction on & (¢ remaining 
fixed) that the given distribution has a resolution on -,k:3k). 
Assume therefore that this is true for all & smaller than the given k. Thus 
by the above we see that there exists a neighborhood V, of S(ki,- + -,k3k—1) 
and a function G, on V, such that G,—Fo is analytic on V,;N Ug for 
all a Similarly, there exists a function G@, on some neighborhood V, of 
S(ky,- such that G.— Fe is analytic on V.N Ug for all « Thus 
(,— G» is analytic on V,N V.. It is well known and easy to see that there- 
fore there exist open sets W, and W, with 
+, kes k—1) CW,CV, 
and 


S(ky,- CW.CV: 


and analytic functions A, and A, on W, and W, respectively such that A,— A, 
and G,—G, are equal on W,W,. It follows that the function B on 
W,U W, defined as A, + G, on W, and A,+ G, on W, has the property that 
B— F, is analvtic on (W,U W.)M Use for all a Thus the given distribution 


has a resolution on 
This completes the induction on k. It follows that there exists a resolution on 


| 
Thus 

and 
) 

| 

| 
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This in turn completes the induction on ¢. It follows that there exists a 
resolution on S(@) —H4, as was to be proved. 


Added in proof: Let LZ be an analytic submanifod of dimension d in a 
Stein manifold K. Let U be an open relatively compact subset of K. Then 
the contruction used in the proof of Theorem II, shows that there exists a 
multiple-valued projection of U into L in the following sense. There exists 
an integer A and an analytic map 


of U into the A-fold unordered product ,Q of the space O= L & (' with itself 
having the following properties. For each p in U, 


For each p in LOU and each i with 171A for which pj~p we have 
2:=0. This fact in turn clearly implies Theorem IJ,, since uw can be extended 
from LZ to U by the formula 


r 
u(p) (pi). 


Similarly the proof of Theorem IT can be used to show the existence of 
certain multiple-valued analytic projections of K into L. 
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ON INDUCED REPRESENTATIONS II: INFINITESIMAL 
INDUCTION.* 


By Ropert J. BLATTNER.* 


1. Introduction. This paper is a continuation of the investigation 
begun in [1].? We take as our point of departure some observation of 
relfand and Graev in [3] concerning the appearance of analyticity restrictions 
on the functions upon which the non-degenerate principal series of representa- 
tions of SL(n,R) act. Their argument runs as follows: Let K(n,C) be 
the subgroup of SL(n,C) consisting of the superdiagonal matrices of deter- 
minant unity. The non-degenerate principal series of representations of 
SL(n,C) is realized in a space of functions defined on the right coset space 
SL(n,€)/K(n,C), or alternately in a space of functions defined on SL(n, C) 
which are (left) invariant under the action of K(n,C). Now the non- 
degenerate principal series of SZ(n,R) may be thought of as acting in spaces 
of functions on SZ(n,R) which are invariant under the (left) action of a 
subgroup of SL(n,C) of the form tK(n,C)a*, x€ SL(n,C). This state- 
ment is to be interpreted as honest invariance under SL(n,R)N aK (n,€)2* 
together with differential left invariance under the Lie algebra of «K (n,C) x", 
regarded as a complex subalgebra of the complexification of the Lie algebra 
of SL(n,R). If the functions are taken to be defined on 


mM = SL(n,R)/[SL(n,R)N cK (n, C) 2x], 


then invariance under the action of the Lie algebra goes over into analyticity 
on certain complex submanifolds of %n. 

Now it is known that the non-degenerate principal series of SL(n,C) 
consists exactly of the representations of SL(n,C) induced by characters of 
K(n,€) in the sense of Mackey [5]. It is therefore natural to ask for a 
generalization of the notion of induced representation which for the case at 
hand gives the non-degenerate principal series of SL(n,R). Our generaliza- 
tion has as its setting a Lie group G with Lie algebra g, a complex subalgebra n 
of the complexification g, of g, a closed subgroup T of G whose Lie algebra 


* Received January 13, 1961. 
* Research supported by the National Science Foundation under grant NSF-G12165. 
? Numbers in brackets refer to the bibliography at the end of the paper. 
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is gM n, a unitary representation L of © on a finite dimensional Hilbert space 
¥%, and a representation p of n on &. Certain restrictions are placed on 1 
and p (for exact statements, see Section 3). The corresponding induced 
representation U“-" is a subrepresentation of the representation U” (see [1] 
for the definition). Here too analyticity restrictions appear on the functions 
upon which U“ operates (Proposition 2). Finally, the representations of 
Gelfand and Graev are of the form we propose, except for certain degenerate 
cases. 

Section 2 develops the necessary preliminary material on manifolds with 
some real and some complex parameters. Theorem 1 gives us a way of pro- 
ducing such manifolds. In Section 3, we introduce our induced representations. 
The problem of calculating intertwining numbers is the subject of Section 4. 
The basic result is Theorem 3 and is based on the intertwining number theorem 
in [1]. (Applications of Theorem 3 to the various series of representations 
of the real semi-simple groups is reserved to another paper.) In Section 5, 
additional restrictions on the complex subalgebras involved allows us to reduce 
the intertwining number problem to one of solving certain differential equa- 
tions, while Theorem 4 gives us effective irreducibility and disjointness criteria 
for the representations arising from this restricted class of subalgebras. 
Finally, in Section 6, we consider two examples: 


(a) Harish-Chandra has also considered the problem of generalizing the 
notion of induced representation in case G@ is a semisimple Lie group and T 
is a special sort of Cartan subgroup. His results may be found in [4]. 
Our irreducibility and inequivalence criteria are applied to his representations. 


(b) In [3], Gelfand and Graev mention a series of representations of 
SL(2k,R) which are not in the non-degenerate principal series. Once again 
our irreducibility and inequivalence criteria apply. 


2. Partially complex manifolds. We begin by laying down some con- 
ventions and recalling some facts. 


Vector spaces. If ¥ is a complex vector space, then YW, will denote the 
real vector space obtained from Y%& by restricting the field of scalars. If W 
is a real vector space, then Y, is the complexification of YW, and YW will be 
regarded as a real subspace of W,,. In Y&,, + will denote the unique conju- 
gate linear isomorphism which =—TJ on &. if W is a linear map of the 
real vector space ¥ into the real vector space WW, then ©, will denote the 
usual extension of ¥ to a complex linear map of &, into W.. If BW is a 
complex vector space, then x is the unique complex linear map of W,, onto W 
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which leaves every point of W,—W® fixed. If B and W are complex vector 
spaces and if W is a linear map from &%, into %,, then W is a complex linear 
if and only if ©. carries ker(x) into ker(«). 


Differentiable Manifolds. All functions and vector fields will be assumed 
to have open domains unless otherwise stated. Let 9% be a manifold. If 
p€ M, then 9%, will denote the tangent space to In at p. If f is a differ- 
entiable function from 9n to a complex vector space, if p€ dom(f), and if 
Xp€ My, then Y,f is defined in the obvious way. A complex distribution 


of dimension 7 on 9 is a function m from 9% to LJ My- such that my, (the 
pem 


value of m at p) is an r-dimensional complex subspace of IMp-; the properties 
of being analytic, involutive, etc., are defined in the obvious way. If ¢ is a 
differentiable map of 9% into another differentiable manifold, then ¢, will 
denote the induced map of tangent spaces. 


Complex manifolds. Let In be equipped with a complex structure. Then 
n,° will denote 9%, equipped with the canonically induced complex linear 
structure. To each p€ 9n we assign c, the kernel of « in Mpc (= Mp%re). 
¢ is a complex C® involutive distribution called the canonical distribution of IM. 
If ¢ is a differentiable map of 9h into another complex manifold, then ¢ is 
called holomorphic if $, is complex linear. This is equivalent to demanding 
that dx- take cp into ¢g(p) for all p€ dom(¢). If f is a function from In 
to a complex vector space, then f is holomorphic in the above sense if and 
only if cpf 0 for all p€ dom(f) (Cauchy-Riemann equations). If X is a 
vector field with values in M,y., p€ M, then X will be called holomorphic if 
Xf is holomorphic for all holomorphic functions with values in C. 


Mixed spaces. The canonical projection of C” R" onto (resp. R") 
will be denoted @ (resp. p). If «€ Rk”, then @, will denote the map z—(z, x) 
of C” into « R". 

Our basic notion, that of a partially complex structure on a differentiable 
manifold, is to the mixed space C” & R" as the notion of complex structure 
is to the space €”. 


Definition. Let IM be a real CY manifold of dimension n. A partially 
complex structure of dimension r (27 =n) on YM is a maximal collection @ 


of maps satisfying: 


(1) each ¢€ @ is a CY” diffeomorphism of an open set in 9 with an 
open set in 
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(2) the domains of the ¢€ @ cover MN; 


(3) if @ and then pogoy depends locally only on 
Rr and is holomorphic. 


Remark. As usual, if @, is any collection satisfying (1-3), then @, is 
contained in a unique maximal collection @ satisfying (1-3). Hence to 
define such a structure, we may ignore the maximality condition. 

An open set O in 9 is called cubical if there is a coordinate function 
¢€ 6 with domain O whose image is a set of the form [(21,---,2,21,° °° 
Ln-or): | %| <a,| | <a] for some fixed a > 0. 

We introduce a complex distribution m as follows: if 6€ @ and p€ dom ¢, 
then mp—ker[ (p°d) xc] 1(0°¢) xc? (Cogip)))- It follows from condition 
(3) above that mp, is independent of the choice of ¢. From the way m looks 
under ¢,- for a fixed ¢€ @, it is clear that m is involutive, that mM m* = {0}, 
and that m-+m* is involutive, being in fact the complexification of the 
tangent distribution to slices of the form [p€ dom ¢: p(¢(p)) =constant |]. 
Each maximal integral manifold of m-+m* is a complex manifold in the 
obvious fashion and will be called a canonical complex submanifold of 


THEOREM 1. Let In be a real C® manifold of dimension n. Let m be 
an involutive complex distribution r(2r=n) on MM such that mN m* = {0} 
and m + m* is involutive. Then M bears a unique partially complex structure 
6 of dimension r such that m is the canonical distrbution of (MM, 6). 


Proof. Let p€ Mm. By the complex Frobenius Theorem there exists 
complex valued C’ functions f:, - -.fn defined in an open neighborhood V 
of p such that for all g¢€N the annihilators in Mg of {f1,- - -,fn}, 
* fn}, and fn} are {0}, mg, and respectively. 
Without loss of generality we may assume that N is a cubical neighborhood 
of breadth a about p with real coordinate functions g;,: - -,g, such that slices 
of the form gi(p) = &, | & | <a, 1=2r+1,---,n are integral manifolds 
of m+m*. Set 6= (fri X for) X (Gers Gn), a map of 
N into is regular. In fact, if ¢.(X,) =0, M,, GEN, 
then so that X,f,—0 for i—r+1,---,n. Hence m, 
MO M,— {0}. Shrinking WN if necessary, we may assume that ¢ is a diffeo- 
morphism of V with an open subset of €” > R"-*r, 

Let 6, be the collection of all ¢’s obtained in this fashion for arbitrary 
p€ Mm. Properties 1 and 2 in the definition are true by construction. -\s 
for property 3, let ¢,y€ @o. Let y€ dom(p° poy") and let N, be a neighbor- 
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hood of y*(y) such that y(No) is a cube Cdom(pogoy*). Then slices of 
N, of the form [p€ No: p(¥(p)) = constant] are integral manifolds of m+ m* 
contained in dom¢. Therefore p°¢ is constant on these slices, and this 
implies that po doy restricted to y(N,) depends only on Now let 
set and let z€domh. Observe that x 
maps ¢; univalently onto my-12,.) and that (6°¢), maps this space in turn 
univalently onto cy :). Since hy, maps c; into ¢n;:, we conclude that h is 
holomorphic. 

If @ is the partially complex structure on 9m determined by 6, it is 
clear that m is the canonical distribution of (9%, @). As for uniqueness, 
let @’ be any partially complex structure satisfying the conclusion of the 
theorem and let ¢€ @>. The arguments of the last paragraph show that 
6’ U {¢} also satisfy the theorem. By maximality we conclude that 6 = @’. 


3. The induced representations. For any unexplained notation con- 
cerning induced representations the reader is referred to [1]. Let G@ be a 
Lie group with left invariant Lie algebra g and modular functions A. If 
Y€g and x(.) is the one-parameter subgroup of G lying beneath XY, we 
have (Xf) (y) = Dif (yx(t))|+=0 for all differentiable functions f. We define 
a mapping ’ of the family of differentiable functions onto itself by 
f(x) =f(a"). Clearly ’ is involutive. If X is a differential operator on 
with C® coefficients, we define YU by (Xf’)’ for all C* functions f. 
4 is clearly an involutive automorphism of the algebra of all C* everywhere 
defined differential operators on G. If XY €g and x(.) lies beneath X, we 
have (y) —— Dif (a(t)y) | 

Let T be a closed subgroup of G with Lie algebra ) and modular func- 
tion 6. Let n be a complex subalgebra of g, such that nN n*—bh,, 
(adP)n—n, and is a Lie algebra. Let =G|T and let be the 
canonical projection of G onto I. 


PRoposITION 1. zy(nU,) depends only on x(x). Moreover, if we set 
Mp = 7% (nU,) where r(x) =p, then m is the canonical distribution of a 
unique partially complex structure @ on M. 


Proof. Let X € g and x(.) be the one-parameter subgroup of G@ lying 
beneath Y. Let f be a differentiable function on 9. Then 


(XU ¢,) f = (f = — Dif (x(t) ég)) | 
= — Dif — (ad f 
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for €€T, g€G. Since (ad!)n—n, we conclude the truth of our first 
assertion. 

Since the kernel of +, |n4, is just = nz M n2*, we see that mM m* = {0}. 
If x is a local cross-section of G over 9, then a (mp) = nNUOe(p) and 


+ = + (p)* = (1 + n*) Oe 


for all p€ dom(a). It follows from [2], Ch. III, §5, Proposition 2, that m 
and m-+m* are involuntary. Our second assertion now follows from 
Theorem 1. 

Our induced representations are obtained as follows: Let L be a unitary 
representation of [T on the finite dimensional Hilbert space ¥ and set 
M(é) =8(€)4A(€)4L(E). Let p be a representation of n on ¥ such that 
p|h = 4M and p(adéX) = M(é)p(X)M(E)7 for€€ Land X€n. As in [1], 
form the space % of functions from G to 8 determined by fT and L. This 
paper, we shall denote this space by %” to show its dependence on L. Let 
/" be the linear space of all f €%” such that 


(1) the restriction of f to any integral manifold of n+ nO is of 
class and 

(2) [AU+ p(X) for all YE n. 

It is possible that 3”? {0}. In any case, condition (2) is automatically 
satisfied for Y € h, by any fe 3”. It is easy to see that U” sends §”” into 
itself. The restriction of U” to %”" (or rather to §”, the corresponding 
pre-Hilbert space) is called the representation of G induced by L and p. 
It is denoted 

In order to show that §”" is complete, we shall characterize the fe 34’ 
in terms of our partially complex structure @ on Mn. By a charting function 
for L and p will be meant a C’ function H with values in GL(%) such that: 

(1) dom(H)—7"'(O), where O is some relatively compact cubical 
neighborhood in (9%, @) over which is defined a C® cross-section in G; 


(2) H(ér) =M(é)H (a) for €€T, and 
(3) [XO+ p(X)]H =0 on (0) for all X€n. 


Lemma 1. Let po€ M. There exists a charting function H such that 
Po € (dom (#7) ). 


Proof. Let O, be a relatively compact cubical neighborhood of p, over 
which a C® cross-section « into @ is defined. Then the map o: T X O, into 
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G given by o(é, p) =€a(p) is a diffeomorphism T X with w*(0,). 
Now equations (3) are C’, non-singular, and satisfy the (complex) condition 
of Frobenius for local integrability. Hence there is a neighborhood N of 
a(po) for which a C’ function H, from N to GL(%) is defined satisfying 
(3). Moreover, we may assume that VN =w(U XO), where U is a cubical 
neighborhood of e in T and O is a cubical neighborhood of po. We 
define H on r1(O) by H(éa(p)) = M(é)Hi(a(p)) for all €€T and pe O. 
Note that H coincides with H, on N since (3) implies (2) whenever 2 and 
ér€ N, €€T (connectivity implies €€ T)). H satisfies (2) by construction 
and satisfies (3) in N. Now if 1 € g and xr(.) is the 1-parameter subgroup 
of G lying beneath , we have 


(XOH) (ég) =— D,[H (x(t) ég) | 
= — | = M(é) [(ade*X) (9), 


for €€T, g€ Moreover 
p(X) H (ég) = M(€)M(é)p(X) M(€)H(g) = (é)p(ad H(g) 


for XY€n. Since (adT)(n)—n, we conclude that H satisfies (3) in 
rN =-71(0). 

Fix a charting function H with domain 71(0). Let f be a measurable 
function from 71(O) to ¥%. f satisfies condition (1) in the definition of 
wv’ if and only if H-'f does. Moreover 


[XO + p(X) ]f p(X) ] 
({XO+ p(X) 1H) (Hf) + H[XO(H™f) | = | 


for all YX €u. Hence f satisfies (2) in the definition of 3” if and only if 
=0 for all YE un. Now if f€ 3”, then is a measurable func- 
tion (0) to 8 which is constant on the right [T cosets of G. Thus 
H-*f ="f om, where “f is a unique measurable function from O to ¥. It is 
easily see that H-'f satisfies condition (1) in the definition of %7" if and 
only if “f is of class CY on the integral manifolds of m-+m*. Moreover 
nO(H-'f) if and only if m’f—0. We have proved 


Proposition 2. Let D be a collection of charting functions for L and 
p such that G=U|dom(H): HED]. Let Then fe if and 
only if 4f is a complex analytic function on the canonical complex submani- 
folds of (I, B) for all He D. 

Let H be a charting function for L and p. Let ¢ be a coordinate map 
in @ with domain z(dom(H7)) and image O. Let & be a cross-section of G 
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over r(dom(H)) and let B=aog™*. Let w be the homeomorphism of T x O 
onto dom(H) given by w(é,p) =€B(p). Let be the measure on 
induced under w» by right Haar measure on G, and let y be the density of p 
with respect to the cartesian product of right Haar measure on I by Lebesgue 
measure on O. We know that y(é,p) =8(é)7*A(é)y(e,p) for €€T and 
p€O[1]. Finally, for each compact K CO let 


Min[y(e, p) || H(8(p))* pe 


Let f€ 4% and let K be a compact set in O. Choose h€ C*,(dom(#) ) 
such that OS7h=1 and rh=10n K. Then 


f ||? dps = J. f(z) ||? h(x) dx 
Sel H (x)¥f (x(x) ) ||? h(x) de 


f IF dp 


because || H(éB(p))-* y(é, p) is independent of and rh —1 on K. 


If {fn} C Hb and f,mf€ $4, we have, for each compact K CO, that 
Hf,og and 4fog are in L,(K) and that #f,o¢+— fog" in L,(K). 
Moreover “f,, © ¢-* is complex analytic in its complex parameters, whence “f is 
complex analytic on the canonical complex submanifolds of (9n, @). Apply- 
ing Lemma 1 and Proposition 2, we obtain 


THEOREM 2. §/? is complete and U"? is a unitary representation of G 


on 


4, An intertwining number theorem. Suppose now that with G we 
are given two collections {Tj, 1, Bi, Li, pi}, 1 = 1, 2, satisfying the conditions 
of the previous section. We wish to calculate an upper bound on the dimen- 
sion of the space Je of operators intertwining 
Ulvm and U4»'2, We shall be aided in this by the fact that [74+ is a sub- 
representation of Define the (non-unitary) representation corres- 
ponding to LZ; as in Section 3. For f€ Co*(G) and ve &;, set 


«°(f,v) = M,(€)vdé. 
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Let P; be the projection of H” ond H™*: and define «(f,v) = Pie? (f,v). 
If A€ R(U4,U"), fe Co*(G), and ve set (Aa(f, v)) (e). 
This definition makes sense: see [1], §5. If A€ R then 
AP, =P,AP,€ (U4,U™) so that we may set ra(f)u—rap,(f)v. If 
r,y€ G, we set (f2,f)(z) =f(a zy) for fe (G). 

If S is an operator mapping C,*(G) into itself, the operator ‘S, if it 
exists, will be defined by the identity (Sf) (x)g(r)dx= f f(x) (x) dx 
for all f,g€ Co” (G@). It is easy to check that if X€g, and z,y€ G, then 
=—X, *(XU) =— XU +4 (XA) (e), and If Z is any 
vector-valued distribution (in the sense of Schwartz [6]) on G, we set 
SZ=Zo't§. Finally, any locally integrable function F on G will be identified 
with the distribution f— f f(x) F(x) dz. 

Our intertwining number theorem now goes as follows: 


THEOREM 3. Let X, be an element of the universal associative enveloping 
algebra of g which, as a left invariant differential operator on G, 1s elliptic of 
order >4dim(G/T,). For (G), set f Xof llo +] f For 
each relatively compact open set O of G, give Cy*(O) the topology induced 
by || . give Co” (G@) the inductive limit topology. Let M be the sub- 
space of maps ZE (G); which satisfy the distribution 
equations : 


(1) (&)™* for &€ Ty; 
(2) XO'Z—Zp.(X)*, 

(3) XZ=—p2(X)Z, XE ne. 

Then < dim M. 


Proof. From [1], Theorem 3, we deduce that A >, is a faithful linear 
map of into the subspace of maps Z € £(C,”(G); 
satisfying (1). To show that r, satisfies (2), we note that 


(a(f0).9) f(x) (2,9(2))de 


for f€ Co” (G@), ve and ge This follows immediately from the 
definition of «, and the proof of Lemma 2(a) in [1]. Now (v,g(.)), regarded 
as a distribution on G, satisfies the equation YU*(v, g(.)) = (p1(X)*v, g(.)) 
for X € n,, in virtue of properties (1) and (2) of %4“". Therefore, 


(e.(*XO'F, v), 9) = f (x) (v, g(x) ) de 
= (er(X) g(x) = (e1(f, p(X) 9). 
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over r(dom(H)) and let B=aog™. Let w be the homeomorphism of © x O 
onto dom(H) given by w(é,p) —€B(p). Let yw be the measure on I x O 
induced under w» by right Haar measure on G, and let y be the density of » 
with respect to the cartesian product of right Haar measure on I by Lebesgue 
measure on O. We know that y(é,p) =8(é)7*A(é)y(e,p) for €€T and 
p€O[1]. Finally, for each compact K CO let 


R;, = Min[y(e, p) || 1(8(p))* pe K]. 


Let f € 3” and let K be a compact set in O. Choose h€ C*,(dom(#) ) 
such that and rh=10n K. Then 


= I dp 
because || H(éB(p))-* ||" y(é, p) is independent of é and rh =1 on K. 


If {fn} CH and f,—f€ $4, we have, for each compact K CO, that 
and 4fog are in L.(K) and that #f,o¢+— fog" in L,(K). 
Moreover “f,,o ¢~* is complex analytic in its complex parameters, whence “f is 
complex analytic on the canonical complex submanifolds of (9n,@). Apply- 
ing Lemma 1 and Proposition 2, we obtain 


THEOREM 2. §? is complete and U" is a unitary representation of G 


on 


4, An intertwining number theorem. Suppose now that with G we 
are given two collections {Tj, 8; mi, Bi, Li, pi}, t= 1, 2, satisfying the conditions 
of the previous section. We wish to calculate an upper bound on the dimen- 
sion of the space (U4, of operators intertwining 
Ulm and U'»':, We shall be aided in this by the fact that U4«+ is a sub- 
representation of U. Define the (non-unitary) representation M; corres- 
ponding to J; as in Section 3. For f€ Co*(@) and ve &;, set 


0) (2) = f (Ex) 


_ 
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Let P; be the projection of ond and define «(f,v) = Pie? (f, v). 
If A€ R(U4,U"™), fE (G4), and ve Bi, set r°a(f)v— (Aa°(f, v)) (e). 
This definition makes sense: see [1], §5. If A€ & (U4, U%2), then 
AP, =P,AP,€ R(U4,U™) so that we may set ra(f)v—rap,(f)v. If 
G, we set (f2,f)(z) =f(a*zy) for f€ Co? (G). 

If S is an operator mapping C,*(G) into itself, the operator ‘8, if it 
exists, will be defined by the identity (Sf) (x)g(x)dx= ff (x) (*Sg) (x) dx 
for all f,g€ Co*(@). It is easy to check that if X€g, and z,y€ G, then 
*(X0) =— XU+4 (XA) (e), and If Z is any 
vector-valued distribution (in the sense of Schwartz [6]) on G, we set 
SZ=Zot't§. Finally, any locally integrable function F on G will be identified 
with the distribution f—> f{f(x) F(z) dz. 

Our intertwining number theorem now goes as follows: 


THEOREM 3. Let X, be an element of the universal associative enveloping 
algebra of g which, as a left invariant differential operator on G, is elliptic of 
order >4dim(G/T.). For (G4), set | f Xof f lle. For 
each relatively compact open set O of G, give Cy*(O) the topology induced 
by ||. |lx,3 give Co®(G@) the inductive limit topology. Let Mt be the sub- 
space of maps ZE £(Co* (G); L(Bi,Bz)) which satisfy the distribution 
equations : 


(2) XO'Z—Zp,(X)*, 

(3) XZ—=—p.(X)Z, XE te. 

Then = dim 


Proof. From [1], Theorem 3, we deduce that A — 7, is a faithful linear 
map of into the subspace of maps Z € (G) ; (Bi; Be)) 
satisfying (1). ‘To show that 7, satisfies (2), we note that 


(a(fv)s9) = f(x) (2,9(2))de 


for f€ Co*(G@), ve Bi, and ge This follows immediately from the 
definition of «, and the proof of Lemma 2(a) in [1]. Now (v,g(.)), regarded 
as a distribution on G, satisfies the equation XU*(v, g(.)) = (p1(X)*v, g(.)) 
for X € n,, in virtue of properties (1) and (2) of 34%. Therefore, 


(a (‘XO'F, v), 9) = f (AUF) (x) (v, g(a) ) dx 
= ff (x) g(x) )da = (ex (f, p1(X)*), 9). 


| 
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From this we obtain 


(XOtr4) (f)v = (Aa (‘AU'F, v) ) (e) 
= (Aa (f,pi(X)*v)) (€) = (f) pi (X) *o 
for X € ny. 


As for property (3), we recall from the proof of Theorem 3 in [1] that 
ra )v = (Aa(f,v))(y) for all ye G. Now the mapping y>£.yf is a 
C? map of G into C,*(G@). Since ry is continuous and linear, we see that 
(Ae, (f,v))(.) is a C* function and that ra(Xf)v = [X (Ae, (f,v) ) ](e) 
== — [XU (Ae, (f,v))](e) for all X€g. If XE, we conclude that 


ra(Xf)v = p2(X)ra(f)v. 


5. Totally complex subalgebras. A complex subalgebra n of g, will 
be called totally complex if n+ n* = ge. 


Proposition 3. In Theorem 3, let either n, or ny be totally complex. 


Then M consists entirely of CY functions. 


Proof. We shall give a proof for the case when nz is totally complex. 
The other case is entirely analogous. Let {X,,---,Xn} be a basis for 
ho, and extend it to a basis {X1,---°,Xn,Xnu,°°°,Xr} of mt. Then 


is a basis of g,, by our assumption. If we 
write X¥;—=Y;+1Z;, where Y;,Z;€ g for j—=n-+1,:---,r, it follows that 
Xn, Zr} is a basis of g. Now the differential 


operator = 5", Xj*(Xj + =— XP — — + 
terms of lower order. This shows that sg is elliptic (cf. [7]) and cm, if 
ZEM, then SZ 0 according to property (3) of Mt. Therefore, Z is a C’ 
function (see p. 146 of [6] and the references given there). 

Let n be a totally complex subalgebra of g... We shall say that z is n- 
regular if n-+-ada-*n* =—g, and shall denote the set of n-regular elements 
by Rp. 

LEMMA 2. In Theorem 3 assume the following: G is connected; T, =T, 
=T, say; and ny = nz = 0, say, a totally complex subalgebra of g.. Let ZEM 
and suppose =0 for some a Ry. Then Z=0. 


Proof. We choose vector fields X,,- - -,X,€1 such that, fer suitably 
chosen r, {X;,- >,X,,ad ad *X,*} is a basis for g,. It is 
then easy to see that there is a C” cubical neighborhood N about 2 such that 
{X,,° - -,ada-*X,*} is a basis for g, for all z€ N. Let 
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1,’ * *>%n be the coordinate functions for N. There exist C® functions 4; 
for i,j==1,- - -,n such that, for all z€ N, 
0 r 4 n + 
J 
Now for any Y€ g., Therefore 
by virtue of Theorem 3. But Z(xz)) 0. From the uniqueness theorem for 
ordinary linear differential equations we conclude that Z vanishes on N. Since 
Z is a C” function, Z vanishes identically on G. 


We are now in a position to prove 


THrEorREM 4. Suppose that G is connected and suppose the induced 
representations and satisfy the following conditions: L, and Lz 
are defined on T; p; and p, are defined on n, a totally complex subalgebra of gc. 
For each x€ G, define a representation pi, of ada n* by setting 
= p,(ad 7X*)* for X € ada-n*, also a representation M,, of x*Tx by setting 
M,2(€) = M, for €€ Define I(x) to be the dimension of 


where 
ad zn’, pe|nN ada Na Pe, 


Then (1) I(x) depends only on the T: T double coset to which x belongs 
and (2) = GLB[I (x): Ry] <o. 


Proof. First note that e€ Ry so that GLB[J(r): «€ Ry] Co. Fix a 
point Ry. For each ZE M, set A(T) =Z (a). The map QE L(M; 
; 

(a) QE L(M;R.,). In fact, let ZEM. If EE TN x then 
LoCo AE T and Ao. Property (1) of Mt implies that Z 
= M,(€)Z (2) so that (xo) = Again, if 
XY€nNada,'n*, then ada X*+€ n. Properties (2) and (3) of M imply that 


Z (Xo) pix, (X ) = Z (Xo) pi (ad 
= ([ad aX (a0) = — (XZ) (Xo) = p2(X)Z (0). 


Therefore 0(Z) € 2,. 


(b) is one-to-one. In fact, suppose Q(Z) =0. Then Z(2) =0 so 
that Z=0 by Lemma 3. 


ut 
) 
| | 
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We have shown that © is an isomorphism of Mt into ®,,. This proves 
statement (2). The proof of statement (1) is standard and will be omitted. 


4 


CoroLLaRry 1. In case n is totally complex, a necessary condition for 
Hh" {0} is that 


Proof. In Theorem 4, let ¥,—¥.—%, Li =—L,—L, and pi=p2—p. 
If {0}, then M+ {0} so that {0}. Let OASE R,. Then, for 
all €€ T, = 8(é)4A(E) AL (E)S. It follows that 


8(€)A(E)* || Sv || =| SL(E)of SI S| oll 


for all v€ But this implies that 8(€)A(é)*] S||S|| for all €€T, 
and our conclusion follows. 


CoroLLaRy 2. Suppose n is totally complex. If L is irreducible and 
if HY? A {0}, then U" is irreducible. If L’ and p’ are two more such repre- 
sentations defined on T and n, then U"'"' and U™? are inequivalent if L’ and 
L are inequivalent. 


Proof. Apply Theorem 4 to (L,p) and (L’,p’). We get 1(U%, U"'"’) 
<I (e), which equals 1 or 0 according as L and L’ are equivalent or inequiva- 
lent. Our corollary follows immediately. 


7. Some examples. 
(a) Some representations of Harish-Chandra. 


Let G be a real semi-simple Lie group and let T be a Cartan subgroup. 
In this case, A= 1 and 81. We suppose that the image of I in the adjoint 
group of G is compact. Let g (resp. h) be the Lie algebra of G (resp. I). 
h- is a Cartan subalgebra of g,, and our assumption implies that every root 
of g- with respect to §, is purely imaginary on }. Let the roots be totally 
ordered in the usual way, and set n equal to h, plus the space spanned by the 
root vectors belonging to the positive roots. Let X be a root vector belonging 
to the positive root a Let Then [H,X+] 
= («(H)X)*=——a«(H)X* since «(H) is pure imaginary. We see that n° 
is h, plus the space spanned by the root vectors belonging to the negative 
roots. We have that n is a complex subalgebra of g., that n+ n*—g,, and 
that nO Since is connected, (adT)n—n. 

Let X be a character of the abeilan group Tf. dx extends uniquely to a 
complex linear representation p of n on C: merely note that p(X) must vanish 
whenever Y is a root vector belonging to a positive root. Let X be the 
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collection of all characters x of T such that $**~ {0}. Then the repre- 
sentations Ux? for x€ X are all irreducible and mutually inequivalent. The 
actual structure of X has been studied by Harish-Chandra in [4]. 


(b) Certain representations of GL(2k,R). 


Let % be a real finite dimensional vector space and let g be the Lie 
algebra of all T€ £(%;%W) with vanishing trace. g, may then be identified 
with the Lie algebra of all T'€ €(2%8,;%.) with vanishing trace. Suppose 
dim — 2k, k integral. It is then possible to choose vectors -, € We 


such that {v1,- Ux, is a basis for W,. Let n— {TE ge: v; is 
an eigenvector of for j=1,- - -,k}, a Lie subalgebra g,. Then =n 
equals {7'€ g,: v; and v;* is an eigenvector of T for 7—=1,-: - -,k}, a Cartan 


subalgebra of g,, and h =}, g is a Cartan subalgebra of g such that h, =}. 
We define linear functionals -,A, on n by for T € n. 
It is easily seen that the derived algebra n® of n—={T En: Aj(T) =0 for 
j=1,:--+,k}. Our first object is to determine the space P of all homo- 
morphisms of n into C such that p(T) is pure imaginary for p€ P, T€ b. 


LemMA 3. P= {a>%,A;: a€ R}. 


Proof. Clearly P is the set of all linear functionals p on n such that 

p(n®) and p(T*) p(T) for TE h,. We first show that P. 
Clearly =0. Let TE Then Ttv;* = — (Tv;)* =—A,(T) 
j 
Since 7’ has vanishing trace, we conclude that }*,A,(T*) = 3 *,A,;(T). 
j j 
Conversely, let p€ Po. Suppose 7'€ n satisfies —0. Define 
S€ £(W.; We) by Svj —Aj(T) vj, —0 for 7—=1,- - -,k. By assumption, 
S€g-. so that S€h,.. It is also clear that 7—S and Sté€ n®. Therefore 
p(T) =p(T—S) + p(St) =0. We conclude that p=a>*,A; for some 
j 
a€C. If Teh, then p(T) —p(Tt) =a4>4,A,(T). Since *,A; does not 
j j 

vanish on h,, we see that a€ R. 

Let G = SL (8) and let T be the analytic subgroup of G lying beneath 
h. Let Py be the collection of all p€ P such that p|h dx for some (neces- 
sarily unique) character xX of Tf. n=—-+ n® implies that x uniquely deter- 
mines p. A necessary and sufficient condition that p€ Py is that p€ P and 
ip be integral on the kernel of the map 7’ —expT of § onto T, and this is 
easily seen to be equivalent to the requirement that ip(7') be integral for all 
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T € § such that ImA;(7) is integral for j7=1,:--,k. But T—> (ImA,(T), 
- + +-,ImA;,(T)) maps onto R* because the trace restriction on T€ § is 


just }*, Rea;(7) =0. We conclude that p€ Py if and only if p=m>*, 
j j 
where m is integral. 


Since both G and T are unimodular and since I is connected, we obtain 
a series of representations U*" of G, where p€ Po and dx—p|h. If Poo is 
the set of p€ P, for which §**4 {0}, we conclude from Corollary 2 of 
Theorem 4: the representations UX for p€ Poo are all irreducible and mutually 
inequivalent. These representations are mentioned by Gelfand and Graev 
in Subsection 4 of the Introduction of [3]. 
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ON SOME INTERPOLATION PROBLEMS FOR ANALYTIC 
FUNCTIONS.* 


By H. S. Swaprro and A. L. SHIELDs.* 


In the first part of this paper we consider an interpolation problem for 
bounded analytic functions that was first solved by Lennart Carleson [3] 
(a detailed description is given below). We give a new proof of Carleson’s 
result and we obtain analogous results for the spaces Hy. In the second part 
of the paper (Sections Four and Five) we study a related interpolation 
problem in certain Hilbert spaces of analytic functions, using the results 
obtained previously. 

The interpolation problem for bounded functions has to do with the 
existence of sequences of points {z,} in the unit circle having the property 
that, given an arbitrary bounded sequence of complex numbers {w,}, there 
is a bounded analytic function f for which 


f (2n) = Wn (n = 1, 2,- 


If the sequence {z,} has this property we shall call it an interpolating sequence. 

The problem of the existence of interpolating sequences was posed by 
R. C. Buck. The classical interpolation theory (see, for example, Nevanlinna 
[10] and Schur [15]) does not seem to be of help here. For example, Schur 
gives an explicit algorithm for finding the bounded function of minimal norm 
taking prescribed values w,,: -,w, at the points 2,,- However, his 
method does not seem to lead to an effective estimate of this norm as 
increases and as {w,} ranges over the unit sphere of 1,. 

Independently of one another L. Carleson [3], W. Hayman [6] and 
D. J. Newman [11] studied this problem. Carleson showed that a necessary 
and sufficient condition for {z,} to be an interpolating sequence is that there 
exist a positive number 6 such that 


(C) II | (2 —2n)/(1 — | 28 (k= 1,2,- -). 
nk 
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Newman proved that condition (C) together with condition (H) given 
below are necessary and sufficient fo r{z,} to be an interpolating sequence : 


(H) >| an |) <0 (for all f€ H,). 


Thus Carleson’s result tells us that (C) implies (H), and in fact this is the 
heart of his proof. 

Hayman showed that (C) is necessary and that a slight strengthening 
of it is sufficient. With his stronger condition he is able to give an explicit 
interpolation formula to find an f taking the given values {w,}. He also 
shows that if one can interpolate arbitrary sequences of zeros and ones at 
the points {z,} then condition (C) is satisfied. 

Both Newman and Hayman show that if z, tend exponentially to the 
boundary, that is, if limsup(1—| %n+|)/(1—| 2n|) <1, then {z,} is an 
interpolating seqgence. They point out that if the {z,} all lie on one radius, 
then this condition is also necessary. 

Three years earlier, in connection with an interpolation problem for 
functions of bounded characteristic, A. G. Naftalevitch [9] showed that if 
{zn} satisfy the condition }(1—|z,|) <oo, then there exist points {z,’}, 


with | 2,’|=|2z,| (n=1,2,- - -), for which condition (C) is satisfied and 
for which 
(N) (1 — | zn |) log* | f (zn) | 


for all analytic f of bounded characteristic. He did not show that condition 
(N) is implied by condition (C). 

Subsequently, V. Kabaila [7] pointed out that Naftalevitch’s method 
shows that if }(1—| zn |) then there are points {2n’}, with | 2,’ | =1 2% | 
(n=1,2,- - +), for which condition (H) is satisfied. 

In Section Two of this paper we give a new proof of Carleson’s result 
that {z,} is an interpolating sequence if and only if condition (C) is satisfied. 
We first show by a Banach space argument that, in the presence of condition 
(C), {zn} is an interpolating sequence if and only if condition (H) is satisfied. 
This part of the argument is similar to Carleson’s proof, though differing in 
detail. We next show that (H) is equivalent to the following weighted 
interpolation problem in the space H,: 

For each sequence {c,}€1, there is an f€ H, for which 


f (Zn) (1— | 2n |?) 4 =n (n = 1, 2,- -), 


We complete the proof by showing that if condition (C) is satisfied then 


this weighted interpolation problem can be solved. 


INTERPOLATION PROBLEMS. 515 


In Section Three we discuss weighted interpolation in the spaces Hp. 
We associate with each f€ H, the sequence of weighted values 


Tof = (f(%) (1—| f(42) (1—| 22 
and we consider the following two problems (as always, /, denotes the space 
of sequences {c,} for which > | cp |? <0). 
1. Is 1, C T,H,», that is, can one obtain arbitrary p-th power summable 
values by weighted interpolation using functions of class Hp? 


2. Is C lp, that is, do we have >| f(zn)|?(1—|2n|) for 
all f€ Hy? 

One can show that if (1) is satisfied for some value of p (lS po), 
then condition (C) is satisfied. A Banach space argument shows that, in 
the presence of condition (C), (1) holds for a given value of p if and only 
if (2) holds for the conjugate index g (1/p+1/q=1). The problem can 
then be reduced to the weighted interpolation problem in H, which was solved 
in Section Two. The final result is that for any given value of p (1S po) 
T,H, = 1, if and only if the sequence {z,} satisfies condition (C). 

In Section Four we consider for general Hilbert spaces of analytic func- 
tions a weighted interpolation problem analogous to the H, problem in Section 
Two. Certain equivalences are pointed out and an application is made to 
the H, interpolation problem in case the points z, approach the boundary 
exponentially. 

In Section Five we use the results on H, interpolation and the results 
of Section Four to study interpolation in two special Hilbert spaces: the 
space H,’ of functions that are derivatives of H, functions, and the space 
A, of functions with finite area norm: f {| f(z)|?dady <0. We show that 
if the points {z,} satisfy condition (C), then the weighted interpolation prob- 
lem can be solved in each of these spaces. 

Furthermore, a necessary condition for weighted interpolation in these 
spaces (and in H,) is that the points {z,} be “weakly separated,” that is, 
|) for some constant c>0 and all i,j. If the 
all lie on one radius then this condition is equivalent to exponential approach 
to the boundary and is both necessary and sufficient. This is related to some 


results of Epstein, Greenstein, and Minker [4]. 


1. Background material. As usual H, (1S p<) denotes the Banach 
space of functions f, analytic for | z| <1, for which 


9 


| f if | f (rei) <a. 
r 
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t 
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H,, will denote the bounded analytic functions with the supremum norm. 
For the properties of the spaces H, see the book of Privaloff ([12], Chapters 
1, 2). In particular we shall need the fact that H, is isometric to a closed 
subspace of L,(0,27) (lS If fe Ly, L, (1/p+1/¢=—1), then 


we write 


With this notation H, is the annihilator of Hp: H, =H, (lSp<o). In 
the Hilbert space 1. (1) is not an inner product. 

Duality. Let H be a Banach space with conjugate space H*. Let PCH 
be a closed subspace and let F+ be the set of all those elements of * that 
vanish on F. Then F*, the conjugate space of F, can be identified with the 
quotient space H*/FL. The norm in the quotient space is given by 

+ FL | —inf e* FL). 

On the other hand, the norm of a linear functional is the supremum of its 
values on the unit sphere, and so: 
(2) inf || e* + A || ||f, e*)]. 

F+ feF 

It follows from the Hahn-Banach theorem that the infimum on the left is 
always attained. 

Similarly, by considering H/F and its conjugate space F we have: 
(3) inf | e-+f —sup |(¢,a)]. 

feF F+ 
The supremum on the right is always attained (since, in the weak star 
topology, the unit ball of F1 is compact and e is continuous). See Havinson 
[5] and Rogosinski, Shapiro [13] for a discussion of these duality relations 
and their applications to extremal problems in Hy. 

Quadratic forms. Let (a;;) (i,7—=1,2,---) be given. A result of 
Schur [14] states that if | ai |< M for all j, and if N for all i, 
then 

(4) | | = (MN)*> | a; 


2. Interpolation by bounded analytic functions. Let {z,} be a 
sequence of distinct points in the unit circle. Define a linear operator 1’ 
from H, to 1, (the bounded sequences) by 


Tf= (f (21), f(22),° +). 
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Clearly || 7 || 1. Note that if T is onto, then the interpolation can actually 
be done in a uniformly bounded manner, that is, there is a number M such 
that if w = {w;,} is any element in the unit ball of 1,, then there is an fe H. 
with Tf—w and | f || 

Indeed, let N be the set of those f € H,, vanishing on {z,}. Then N isa 
closed subspace and 7’ induces a one-to-one continuous mapping from the 
quotient Banach space H,/N onto 1,. By a theorem of Banach ([1], Chap. 
IV, Théoréme 5) the inverse mapping is bounded, which is our assertion. 


THEOREM 1. A necesssary and sufficient condition for {zn} to be an 
interpolating sequence (that is, =I.) 1s that 
(C:) | (2x —2))/(1— | (k=1,2,: 


or equivalently, 
(C,) there exist functions f;,€ He such that 

(i) fi = oir, 

(i) 

In this case each sequence w= {wn} in the unt sphere of l. can be 
interpolated by an f€ Ha» with 
(5) fl S (2/85) (1—2 logs). 

We shall use the following notations. 

By (2) (z—2;)/(1— zz), 
Bux (2) = Bn(z) (1 — / (2 — 2x) (kn), 

The equivalence of conditions (C,) and (C,.) is easy to see. Indeed, if 
(C,) is given, then the Blaschke product formed from the points {z,} with 
the point z; deleted, divided by its value at z,, may be taken as the function f;. 

Conversely, if (C,) is given then || || = || fx || S 1/8. Evaluation 
at z, gives 1/| by; |< 1/8 which implies (C,). 

Proof of Theorem 1. Necessity. The necessity of (C.) was proven in 
the remarks preceding the statement of the theorem. 


Sufficiency. Let w= be a bounded sequence (| w;,| <1 for all 
that we wish to interpolate at the points {z,}. The function 


hy, (2) — ( ) /Onk 
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solves the problem at the first » points. We rewrite this 
hy (2) = Bu(2) (1 — /(2 
Therefore the function 
By(2){ (w/b) —9(2))} 


will also solve the problem, for any g€ H,. We wish to choose g so as to 
minimize the norm in H,. Let 
(6) (w) ink || Bu(z){ (1 — /(2— te) — 9 (2) | 

IE 
We must show that m,(w) is uniformly bounded in n and w (w in the unit 
ball of J,,). Let us take (6) onto the boundary of the unit circle, and let ¢ 
denote the rational function given by the summation. Since | B,(e)|=1 


we have 


Mn(w) = inf || (g€ H,). 
9 


We now apply the duality reijation (2), with F=H,, E*=L,, 
FL—=H,. We obtain 


Mn (Ww) | (f€ || f 
Evaluating (f,¢) by (1) we have 
(7) my (w) — sup | f (2x) (1— | 
Since 6S | 1 for all we have 


sup | f(¢)|(1—| 


S sup sup m,(w) 8" sup | f (zx) |(1—| zx |?), 
n f 1 


w 


(8) 


where f € H,, || f and w beolngs to the unit ball of 1,. 

Now if f€ H, has no zeros in the unit circle, then fg? with g¢€ H,. 
If f has zeros, then f = Bf,, where B is a Blaschke product, f, has no zeros, 
If ll= and | f(ze)| <|f:(%)| for all &. Therefore the supremum on 
the right side of (8) is equal to 


(9) sup | 9 (2x) |? (1—| zx |*) (9 € Ho, || 9 lz=1). 


| 2 
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We now show that the finiteness of (9) is equivalent to a weighted inter- 
polation problem in H, 


LemMA 1. Let {z,} satisfy condition (C,) of Theorem 1. Then the 
following two statements are equivalent: 


(1— | 


(10) | 9 (zx) 


for some constant M and all g€ Ho; 


(11) there is a constant m such that if {w,} ts any square summable 
sequence, then there is a g€ Hz with 


lg we |? and g(ze)(1— | 2 |?)3= wrx (k=1,2,---). 


We shall show in fact that if (10) holds, then the constant m in (11) 
can be taken to be (1/8*)M, while if (11) holds, then the constant M in 
(10) can be taken to be m. 


Proof. The proof is similar to the proof given at the beginning of the 
sufficiency part of Theorem 1. Let w={w,;} be a sequence for which 


in a uniformly bounded manner in H, (i.e. (11)) is equivalent to (10). Let 


w; |? 1. We wish to show that the problem of interpolating all such w 


hy (2) > (2) | 2k |?)3. 


Then hy(2.)(1—| |?)2=we (kK =1,2,- 0). 
As before we modify h, and arrive at (6), except that now g€ H, and 
the norm is the H, norm. The duality relation now leads to 


— sup | 3 (2x) (1— | (Il f la—=1). 
The analogue of (8) with both sides squared is: 
[sup sup m,(w) ]? sup | f(z) |?(1 — | |?), 
n w 
where f€ H., || f|2—1, and w is in the unit ball of J... The number 


[sup sup m,(w) |* will serve as the number m in (11), and the result follows. 


To complete the proof of Theorem 1 it is now sufficient to establish (11), 
with N = 2(1/8') (1—2log8). This will be done in two lemmas, the first 


of which occurs in Carleson’s paper. 
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Lemma 2. If {z,} satisfies condition (C,), then 


(12) > (1— | 2; |?) (1 —| |?) /| 1 — |? 1— 2 logs 
=1 
Proof. The identity 
| (2% — 25) / (1 — |? = 1 (1 — 2; |?) (1 —| |?)/| 1 — |? 


together with the inequality 1— b Sexp(—D) and a little calculation shows 
that the series in (12), summed over j/k, is S[—2 log4, and the result 


follows from this. 


Lemma 3. If {z,} satisfies condition (C,) and tf {w,} is any square 
summable sequence, then there is a g€ He for which 


lg |? S (28+) (1 —2 log 8)> | wx. |? 
and 
(2x) (1 — | |?) = wx (k=1,2,- °°). 


Proof. For this proof (f,g) will denote the inner product in H,: 


(in contrast to (1)). Let ; 


gnu (%) = (Bn(z)/(%— 2x) )?(1 — | 2x 
and 


fn Wr9nk (2) /Onk?. 
Then fn(2z)(1—| 2x |?)#= we (k=1,2,---,n). Moreover, 


(14) | fa? (fu fa) = (9 


Here g; stands for g,,. A calculation shows that 
(1— | 2; |?)8(1 + 


(9, 9) = (1— | 


Let aij—= (gi,9;). An easy calculation shows that (1— | z |?)#(1—| z;|?)a 
= |1—%z;| for all i,j. Then by Lemma 2 we have: 


| ay | (1—| |?) (1 —| |?)/| 1 — Hey |? S 2(1 —2 logs). 


Since a;;—= 4, the same bound holds for the sum on 7. Therefore by (4) 
we have | f (2/8*) (1—2logd)> | |*, and the result follows. 
This completes the proof of Theorem 1. 
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Remark 1. Condition (C.) is conformally invariant and so Theorem 1 
may be carried over to any simply connected domain. Thus in the strip 
ly|<o the points z2,—km satisfy condition (C,) with the functions 
f,(z) = sin(z—kr)/(z—kr). Here 1/8 sup sup | f(z) | = /2o. 


Remark 2. The functions f, that were used in the proof of Lemma 3 
are not the functions of minimal H, norm that solve the interpolation problem. 
It is a curious fact that if one uses the functions that must be minimal (their 
form is known, for example, from [13] Theorem 16), then one is led to a 
quadratic form that is much more difficult to handle than the one in (14). 
This form will be considered in the corollary to Theorem 3. 


3. Interpolation by functions of class H,. Let the sequence {2,} be 
given and let (s) denote the set of all sequences of complex numbers. For 
each p (1S poco) define a mapping Hp— (s) by: 


T of = (f (21) (1— | f(z2) (1— | 22 

THEOREM 2. For each p (1S poo) the necessary and sufficient con- 
dition on {2n} in order that T,Hp=l, is that {zn} satisfy condition (C). 
or equivalently, 

(Cop) there exist functions f,€ Hy such that 

(i) (1 — | = 8x 

(ii) fr Jp S170. 


The proof is very similar to that of Theorem 1 and so we shall merely 
sketch it. First, the equivalence of (C,) and (C2) is proven as before, 
using also the fact that | f(z)| = || f lp/(1 —| |?)*/” for all fe H, (see [8], 
p. 304). 

One uses the closed graph theorem to show that if lp C TH», then the 
interpolation can actually be done with uniformly bounded H, norms. (If 
N is the subspace of functions vanishing on {z,}, then a mapping is defined 
from J, into H,/N. This map has a closed graph.) This implies the necessity 
of (C.,p). 

It is easy to see that if T,H, C ly, then T, has a closed graph and is 
therefore a bounded operator (this was obvious for poo). 


Lemma 4. Let condition (C,) be satisfied. Then TypH» C lp for a given 
value of p if and only if lq C TqH, for the conjugate index q (1/p4+-1/q=1). 


The proof is similar to the proof of Lemma 1. 


= 
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or 


To complete the proof of Theorem 2 one proceds as before (going from 
(8) to (9)) to show that T,H, C I, if and only if T.H. C l., which reduces 
the problem to Lemma 3. QED 


Remark 3. In H, an interpolating function can be given explicitly. 
Let {w;} be the values to be interpolated, with > | w; |(1—| 2 |?) <oo. Then 


f(z) =>(B(z)/B’(%)) [1/(@— 2x) —1/ (2 — 


will solve the problem. Here B(z) denotes the Blaschke product formed from 
the z,, and z,*=1/zZ,. A simple calculation shows that the expression in 
square brackets has norm 1 in H, (since |z—z|=|1—%z| on |z|=—1). 
Also, || B ||] = 1, and so under condition (C,) we have an absolutely convergent 


series of unit vectors in H,. 


4, Interpolation in Hilbert spaces of analytic functions. The results 
in Section 2 on interpolation in the space H, suggest similar problems for other 
Hilbert spaces of analytic functions. By a Hilbert space H of analytic func- 
tions we mean a Hilbert space whose elements are analytic functions in a 
domain D, and in which evaluation at a point is a continuous linear functional 
on H. This means that to each point £¢€ D there corresponds a unique function 
K,(z) € H (the kernel function) such that 


(15) = (f, Ke) 
for all f€ H. Thus for | f || =1 we have 
(16) S| Kell = (Ke, Ky)? = 


Equality is attained for f—K,/|| K,||. It follows easily from (15) that 
K,(b) =K,(a) for alla,b€ D. Indeed, K,(b) = (Ka, Ky), Kx(a) = (Ko, Ka), 
and these are conjugates of one another. For a general discussion of kernel 
functions and their properties see Aronszajn (Proceedings of the Cambridge 
Philosophical Society, vol. 39 (1943) and Transactions of the American Mathe- 
matical Society, vol. 68 (1950) ). 

We note the following examples. 


I. H,. Here K,(z) =1/(1—%). 


II. A, This is the space of functions f(z) = Sa,2", analytic for 
|z| <1, with norm 


Here K,(z) =1/(1—Z)?. 


— 
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Ill. H,.’. This is the space of functions f(z) = S,a,z" that are deriva- 
tives of functions of class H., with norm 


*) dady = | dn |?/(n +1) (n+ 2) 


ff 


Here K,(z) = 2/(1—%z)*. 


IV. D (finite Dirichlet integral). These are the functions f(z) = > anz" 
analytic for |z| <1, with f(0) 0, whose derivative is in A,. The norm 


is given by 


and K,(z) =log{1/(1—@z)}. 


V. Functions of exponential type in the upper half plane with 


2dz 


2 f 


y 


Here K,(z) =i/(z+). By a Paley-Wiener theorem these functions are 
Fourier transforms of functions in L.(0,0). 


VI. Entire functions of exponential type = £8 that belong to Lz on the 
real axis. Here the norm is given by 


and K,(z) = {2sin B(z—Z)}/(z—2@). By another Paley-Wiener theorem 
these functions are Fourier transforms of functions in L,(—8, B). 

In this paper we consider only the first three of these spaces. 

In H, we considered the problem of weighted interpolation: when will 
the sequence {z,} be such that {f(2n)(1—|2n|*)4} (f€ He) is the most 
general sequence in /,? In terms of the kernel function K,(z) this becomes 
the following problem: for which sequences {z,} does {f(2n)/(Kn(2n) )3} 
(f€ H) represent the most general element in /,? (We have written K, for 
K,,.) In this form the problem can be posed for any Hilbert space of analytic 
functions. There are really two questions involved here: first, is the sequence 
always in /, for evey f € H, and second, is every J, sequence obtained in this 
manner? Both of these questions can be formulated in an abstract Hilbert 
space as follows (see Bari [2]). 
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Let {zn} be a sequence of elements in a Hilbert space H. We shall 
say that {z,} is a Bessel sequence with bound M if 


(17) | (2%, tn) |? SM || ||? 


for all z€ H. (lf one merely assumes the convergence of the series on the 
left side of (17) for all x€ H, then one has a linear mapping from H to /,. 
This mapping is easily seen to have a closed graph, which implies the existence 
of the constant J.) 

We shall say that {v,} is a Riesz-Fischer sequence with bound m if to 
each sequence {c,}€ 1, there corresponds at least one x€ H for which 


(18) (2, (n= 1,2,: +) and |z/?=m> | |?. 


Here also if one merely assumes the existence of x€ H such that (2, tq) =c¢, 
(n =1,2,- - -) for each square summable sequence {c,}, then the existence 
of the constant m follows from the closed graph theorem. Indeed, if J is the 
subspace of H orthogonal to all the z,, then one has a mapping from /, into 
H/J, and this mapping has a closed graph. 

If H is a Hilbert space of analytic functions in a domain D, and if {z,} 
is a sequence of points in D, then we take for the sequence z, the normalized 
kernel functions: K,(z)/|| Kn ||, where K,(z) = K,,(z). The Bessel property 
then becomes the inequality 


| f(4n) |?/Kn(%n) SM || f Il? H), 


while the Riesz-Fischer property is a statement about weighted interpolation : 
to each square-summable sequence {c,} there corresponds an f€ H such that 


f (2n)/(Kn(4n) = Cn (n=1,2,-- 


Nina Bari [2] shows that in an abstract Hilbert space these properties 
are equivalent to a number of other properties (her terminology is somewhat 
different from ours). We shall need only one such equivalence. 


THEOREM 3. Let {an} be an infinite sequence of elements in a separable 
Hilbert space, and let A denote the inner product matrix ((ai,2;)) (i,j 
=1,2,---). Then 


I. {z,} ts a Bessel sequence with bound M if and only if the matrix 
A 1s a bounded operator on 1, with bound M. 


II. {xn} is a Riesz-Fischer sequence with bound m if and only if the 


matrix A is bounded below on 1. with bound m. 


_ 
—— 
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The matrix A is said to be bounded below by m if || a || =m || Aya || for 
all n-tuples and all n. Here the norms are norms, 
and A, is the n X nm matrix (a,2;) (1,7—=1,2,:--+,n). This is equivalent 
to saying that || a ||? = m(A,a, a) for all n and a, and it is also equivalent to 
|| An? || Sm for all n. 


Proof. I. Let {x,} be a Bessel sequence, let {cy} € 12, let p be fixed, 
and let y= ++ Cp%p. On the one hand, 


ly |? | en(y, |? Cn |? (y, tn) |? 
=M |y|?=| 


Dividing by || y ||? we have: | y |? >| c,|?. On the other hand, 
(19) ly I? = = 


Thus the quadratic form in (19) is bounded by M, which implies that the 
matrix A is bounded by M. 


Conversely, let A be bounded by M and let {e,} be a complete orthonormal 
set in H. We first show that there is a bounded operator T: H—H, such 
that Te, = 2p. 

Indeed, let x; =D dnién, and let T be the infinite matrix (b,4). Then 
T*T =A, and so || T ||? = || T* ||? = || A | (T* is the conjugate transpose 
of 7). Also, Te, 

We now establish the Bessel property. Let r€ H be given. Then 
= (2, Ten) = (T*2, en), and so 


| tn) |? = |] |? SM | |I?. 


II. Let {z,} have the Riesz-Fischer property, let a1,: - -,@n be given, 
and let y€ H be the element of minimal norm such that (y,2;) =a; 
(t=1,---,n). Then y lies in the subspace spanned by 2,:--,2, and 
therefore y= + ++ Let y, € H be such that (y,,2;) =a; (Sn), 
(y1,%:) =0 (t>n), and | y, |? m>|a;|?. Then 


Also, 
= (%,y) = 6; 25), 


or more concisely, @d==A,¢. Therefore 


m || a ||? = —2 C0; = (Ané, = (4, An*@), 


and so || || =m. 
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Conversely, assume that || A,~* || S m, and let {a,} € 1. be given. To show 
that {z,} is Riesz-Fischer it will be sufficient to produce a sequence y» such that 


(Yn, Zi) = é and || Yn 


for by weak compactness a subsequence of y, will converge to an element x 
satisfying (18). 

Let Yn = We wish to choose {c;} such that a; = (yn, 
(i—1,---,n). Just as before this means that d—=A,¢, or Also, 


Yn |]? = (Yns Yn) = (i, = (Ané, 
= (4,A,74) Sm|a|?—m| 
This completes the proof. 


We give an application of this theorem to the H, interpolation problem 
considered in Lemmas 1 and 3, in the case of exponential approach to the 
boundary. In this case we obtain a better constant than is given by Lemmas 
1 and 3. 


Corotiary. If the points {2z,} satisfy the condition 
1—| ener | Sa(1—| 2n |) 


for all n and some « (0<a<1), then 
Fen) [21 — | S2((1 + a8) /(1 — a8) 
for all f€ Hz. 
In Newman [11] (Theorem 2) it is shown that 


((1+ 2")/(1—2")), 
n=1 
where 8 is the constant of condition (C,) in Theorem 1. It can be shown 
that equality is attained here if z, —1—a" (n—1,2,---). 


Proof. Since the kernel function for the space H, (the Szego kernel) 
is 1/(1—%z), the corollary will follow from Theorem 3 if we can show that 
the matrix A = {@nm}, where 


(20) lnm = (1— | 2n |?)9(1 — | 2m |?)8/(1 — 


bounded above by 2(1-+ a)/(1— 4). To do this we apply (4). 
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It follows from the hypothesis that 1— | |S a?(1—|2n|). Also, 
1 —ZnZm | = 1—| 2min(n,m) |. Therefore 


m 
| | => | dam | + E | | 
n=1 1 


m+1 


<2 3 (| am |)/A—| +23 (A —| 


IIA 


2 og(m-n)/2 + g(n-m)/2 < 2(1/(1 ob /1 a). 
1 m+1 

5. Two special Hilbert spaces. In this section we consider the spaces 

A, and H,’ (see Examples II and III of Section 4). We shall need the 
following result of Schur [14]. 


THEOREM (Schur). Let A= (aj) and B= (by) be two Hermitian, 
positive definite n X n matrices of complex numbers. Let a, a» be the smallest 
and largest eigen-values of A, and let B, =inf bi, Bo =sup by (1 -, 0). 
Then C = (aijbi;) 1s positive definite and all eigen-values of C le between 
and 


Since the smallest and largest eigen-values are respectively the infimum 
and supremum of || Cz || for || z || =1, it follows that 


| C | A] and | C* | S A* I. 


Let {zn} be a sequence of points in the unit circle and let H be a Hilbert 
space of analytic functions in the unit circle with kernel function K,(z). 
We consider the unit vectors K,(z)/|| Kn ||. Using the reproducing property 
(15) of the kernel function the inner product matrix of Theorem 3 becomes 
(K;(%)/|| || || K; ||). In particular, in H., A., and H,’ the general element 
of this matrix is: 


(21 ) (8;8;)3/(1 2423), 
(22) As: (1 — 242;)", 
(23) (8:8;)2/ (1 — %2;)*. 
Here 8; = 1— | 2; |?. 


Finally, we introduce the operator 7 that associates to each f€ H the 
weighted sequence of values 


(24) Tf = (f(41)/|| Ka Kel, *). 


Turorem 4. Let {zn} be a sequence of points in the unit circle 


| 


028 H. S. SHAPIRO AND A. L. SHIELDS. 


satisfying condition (C1) of Theorem 1, and let 8,=1—|2|*%. Then 
TA,=—TH,’=l,. More precisely, 


| |? dn? S (1—2logs)|| fl? (all fe Az) 


(25) 
| f(2n) |? S 2(1—2logs)||f (all fe H2’), 


and, to each square summable sequence {Cn} there corresponds an f € Az and 
ag€H,’ such that 
f (2) 8n = 9 = Cn, 
Ig en? 
Here the norms are taken respectively in A, and in H,’, and the number m 


ins the smallest admissible bound for the H, interpolation problem (by 
Lemma 3, mS (2/8*) (1— 2 log8)). 


Proof. It follows from (4) and Lemma 2 that the matrices (22) and 
(23) are each bounded by (1—2logé). Equations (25) now follow from 
Theorem 3. 

Equations (26) follow from Schur’s theorem and Theorem 3, since the 
elements in (22) and (23) are respectively the squares and the cubes of the 
elements in (21), and the diagonal elements are all equal to one. This 


completes the proof. 
We shall need an analogue of Schwarz’s lemma for the spaces A, and H,’. 


Lemma 5. Let a, b be two points in the unit circle and let 
d= (1—|a|?) (1—]| |?)/| 1 — ab |*. 


Then 


| S — | |/| 1 — ab |) + 
for all f€ A. for which f(a) =0. Also 
| S (2 — | |?)2)(| a— |/| 1 — ab |) 4+ d+ 
for all f€ H.’ for which f(a) =0. Equality ts possible in both cases. 


Proof. Let H denote either of the spaces A, or H,’, let K;(z) be its 
kernel function and let J be the subspace of all functions vanishing at a. 
Then J is itself a Hilbert space of analytic functions with the kernel function 


(27) $:(z) = K,(z) —cKa(z), 


where c—K,(a)/Ka(a). Indeed, so for each fixed 
Also, if f€ J, then 
(f,¢:) =f(¢) — f(a) =f 


= 
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and so d¢ reproduces the values of all f€ J. But the kernel function is always 
unique, and so ¢, is it. 


Therefore for f€ J we have: 


f(b) |? =| (f, |? SF I? (do, = I? 


Substituting the specific kernel functions for A, and H,.’ we obtain the 
lemma. 

We shall say that a sequence of points {z,} in the unit circle is weakly 
separated if there is a constant c > 0 such that 


(28) | 2; — 2; | = max 


for all 1, 7 (that is, the distance between points is greater than a constant 
multiple of the distance to the boundary). It is easy to see that if the z, 
all lie on one radius, then this is equivalent to exponential approach to the 
boundary: lim sup 8)41/8n < 1. 

Weak separation is equivalent to the following condition: 
(29) | 2; | 2 | 1— | 
for some constant c, > 0 and all i, 7. Indeed, writing a, b for 2;, z;, we have 
1—|a|?=2(1—|a|) S2|1—4b|, and so (29) implies (28). 


Conversely, a little calculation shows that if |a|=|b| <1 and if 
r=|a—b|/(1—|a|?), then 


|a—b |/|1—ab | 
Therefore, (28) implies (29). 


We now show that weak separation is a necessary condition for weighted 
interpolation. 

TuHEoreM 5. Let H be either of the spaces A,, H,.’, and let T be the 
operator in equation (24). If l,C TH, then the points {zn} are weakly 
separated. Furthermore, if the {zn} are weakly separated, then TH C l,. 
More precisely, if (28) holds and if b=min(c,1), then 
| [28:2 (2/0)? IP (all f€ As), 

D> | f(a) |? S 2(2/b)? |] (all f€ H.’). 


Proof. If 1. C TH, then there is a constant m such that each element 


(30) 


in the unit ball of 7, can be obtained by weighted interpolation from an f € H 


= || f —| Ka(b) |*/Ka(a)). 
9 
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with || f || =m (see the beginning of the proof of Theorem 2). Therefore, 
if i, j are given (1-47), then there is an f€ H, ||f || =m, such that 


f(%)/(Ki(u))?=1, =0. 


(We could require f(z) =0 (m1), but we shall only use the values of / 


at 2;, 2;.) Applying Lemma 5 with a= b =z; we have in 
1/(1 — | Zi = (m/(1 — | 1 — 2,2; 


Therefore 
(31) | zi —z,;|/| 1— zz; | = const > 0, 


and this is equivalent to weak separation. Starting with the space H,.’ we 
obtain (31) (with a different constant) in exactly the same way. 

We now turn to the second part of the theorem. The points {z,} are 
given to be weakly separated and we must establish (30). 

Let b=min(c,1). It follows from (28) that the circles Cj: | z—z; | 
<= 4d8; are disjoint and are contained inside the unit circle. Also, | f(z)!* 
is a subharmonic function and so 


* 


for all f analytic in the unit circle. If f€ A, we sum (32) on 7 and obtain 
the first equation of (30). 

For f€ H.’ we first multiply both sides of (32) by 8; and note the 
inequality: 1—|z;|*=2(1—|z|*) for z€C;. Indeed, the right side is 
smallest for | z| =| 2; | + $d8;< | z; | + 48; and for this value the inequality 
may be verified. Now summing on i we obtain the second equation of (30). 
This completes the proof. 


CoroLuary. If the points {z,} all lie on one radius, then a necessary 
and sufficient condition for weighted interpolation (1, C TH) in either of 
the spaces A., H.’ is that the z, approach the boundary exponentially. In 
this case, one has 1, —=TH. 


This follows at once from Theorems 4 and 5. 


Remark 4. It can be shown that if f€ H, and ¢(z) =1/(1—z)3 
then f; = of € Ao, and H.’. Hence if then f,(2n)8, 
= fo(2n)8n2=c, (n=1,2,:--). Thus if one can solve the H, problem 
explicitly for the case when the points 2, all lie on one radius, then one also 
has an explicit solution of the problem in the spaces A, and 1,’. 


p 
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Finally. we would like to discuss a result of Epstein, Greenstein, and 
Minker [4]. They consider the space H of functions analytic in the strip 
D: |y|<o for which the area norm is finite 


This is a Hilbert space with the kernel function 
K = (2/1607) {sech (4 (z—Z) /40) }*. 


They consider interpolation at the points {kr} (—»o<k<o) and they 
show by Fourier analysis that the necessary and sufficient condition on a 
sequence {c,} that it be the set of values of an f€ H at these points is that 
|?<o. Since K,(#) =const. for real, their problem can be con- 
sidered as a problem in weighted interpolation. 

To put the problem into our framework we consider the mapping 


z= ¢(w) = tanh(rw/4o) 


(wé€ D,|z| <1) from the domain D to the unit circle. The points w, = nz 
vo over into the points z, = tanh(nz?/40). These points lie on the real axis 
(—1<2z <1). and 

1 — | 2, |? = {sech (m?/4c) }°. 


Thus z,— 1 exponentially as n—>oo, and z, > — 1 exponentially as n > —oo. 
An application of Theorem 3 (just as in the corollary to Theorem 3) 
shows that 


for all f€ H.. From Lemma 1, Remark 1, and Theorem 4 we obtain similar 
results for the values of functions in A, at these points. Finally, to go back 
to the space H in the strip it is necessary to “transplant” the functions of 
1, and multiply by the derivative of the mapping function. Thus, if f€ A», 
then 


g(w) =f (p(w) 


is in H, and g(w,) = (2/40) f (Zn) (1 — | 2n |*). 

The theorem of [4] follows from this, and one could obtain a bound 
for the norm of the minimal interpolating function. The method just 
presented will establish a similar theorem for certain sets of points {w,} that 
are not in arithmetic progression. Using the corollary to Theorem 5 one can 


obtain the following result. 


/ 
oO 
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A. 


THEOREM 6. A sequence of real numbers {wn} has the property that 
TH =1, if and only if there is a number d such that for all n, m one has 


| Wn — Wm | = d ~~ 0. 
Here T is the operator: Tf = (f(w,),f(w.),: + +), and H is the Hilbert space 
of (33). 
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REPRESENTATION OF A SEMIGROUP BY TRANSFORMATIONS 
ACTING TRANSITIVELY ON A SET.* 


By E. J. JR.’ 


A representation of a semigroup S will mean here a homomorphism of S 
into the semigroup of all transformations of some set ‘/ into itself. This 
concept may be identified in an obvious way with that of a se¢ with semigroup 
of operators, which will also be called an operand over a semigroup, symbolized 
by My. By this we mean a set J/ together with a semigroup S and a mapping 
(2,a) > va of MX S into M, such that (va)b = x(ab) for all re M, a, DES. 

The theory of representations (by transformations) of groups has been 
extensively studied; accounts of this theory are given in [2] and [4]. The 
corresponding theory for semigroups has been studied relatively little. Stoll 
[6] has studied the representations of finite simple semigroups. He has also 
introduced the concept of transitive system (which we shall call strictly cyclic 
operand or strictly cyclic representation). By this is meant an operand My 
such that = for some x€ M, where eS = {xza: a€ 8}. Thurston [8] 
has studied the congruence relations on an operand Mg (i.e., equivalence 
relations on such that zoy implies zacya for all 8), calling 
them S-equivalences, and calling M an S-set. 

The present paper determines those representations of a semigroup S 
which are strictly cyclic (thus generalizing [6]), and more particularly those 
which are transitive (i.e., in which 7S = M for all «€ 1; not the same thing 
as transitive system in the sense of Stoll) or 0-transitive (i.e., such that 

and for all except one x for which =z). In 
particular we shall obtain conditions for S to have faithful (i.e.. 1-1) repre- 
sentations of the types considered. 

Section 1 deals with strictly cyclic representations of a semigroup S. 
We show that all such representations arise (essentially) by letting S act by 
right multiplication on the set of o-classes where o is a right congruence on S 
such that eava (some e€ S, all a€ 8S) (1.1). The representation is faithful 
if and only if o contains (i.e., is less fine than) no (2-sided) congruence 
except the identity relation (1.2). A necessary and sufficient condition for 

* Received October 28, 1960. 

‘The writing of this paper was supported in part by the National Science Founda- 
tion. The author wishes to express gratitude to Professor A. H. Clifford for many 
valuable criticisms and suggestions. 
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the existence of a faithful strictly cyclic representation (1.3) is an immediate 
corollary; this condition is very weak since it is satisfied by any semigrou) 
with identity (1.4). 

Section 2 shows that all transitive representations arise by letting S act 
on the o-classes where o is a right congruence, eava (some e€ 8S, all a@€ 8) 
and each o-class meets each right ideal (2.1). A n.a.s.c. for the existence 
of a faithful transitive representation follows immediately (2.2). Similar 
but more complicated conditions are obtained for 0-transitive representations. 
As applications of these results, we show among other things that a commu- 
tative semigroup S has a faithful transitive representation if and only if S 
is a group (2.4) and (in Section 4) that the free semigroup of rank 7 has 
such a representation if and only if n= 2. 

Section 3 studies [0-| transitive representations of semigroups with 
certain finiteness conditions (viz., the existence of a minimal right ideal for 
the case of transitive representations, and minimum condition on principal 
right ideals for the case of 0-transitive representations). The main result 
gives a n.a.s.c. not involving right congruence relations for the existence of a 
faithful [0-] transitive representation (3.4 and 3.4’). 

Section 4 gives some examples. 

Notation. Transformations will be written on the right. If A and / 
are sets, A—B denotes {x: A,.¢ B}. denotes the cardinal of 
A CB means “A is a subset of B.”. A C B means “A is a proper subset of B.” 
&, R and # are used in the sense of Green [3] to denote certain equiva- 


lence relations on a semigroup. 


1. Strictly cyclic representations. We say that two operands My ani 
Ng over the same semigroup are isomorphic (and we write Ws = Ny) if there 
is a 1-1 correspondence z between M and N with (ra)r=—(rr)a for all 
M,aéeS. We call an element u of Ma strict generator if uS = M. Thus 
Vs is strictly cyclic if and only if J/ has a strict generator. 

If o is a congruence relation on J/s, we can let S act on the set W/o 
of o-classes by defining Yoa (X € M/s,a€ 8) to be the (unique) o-class 
containing Xa=—{xa: r€ VY}. This gives a new operand which we denote 
by 

If S is a semigroup, we denote by Ss the operand in which S acts on 
itself by right multiplication. It is clear that the congruences on the operand 
Sg are precisely the right congruences on the semigroup S. Thus if o is a 


right congruence on the semigroup, we can form the operand Sx/o. 


We call e€ S a left identity modulo a right congruence o if easa for 


all a€ 8S. 
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The following is the generalization to semigroups of the basic theorem 
on transitive representations of groups. 

Proposition 1.1. Let u be a strict generator of Mg. Define a relation 
ov on S by: acb if and only if ua=ub. Then o is a right congruence, there 
is a left identity modulo o, and Mg = Sg/c. 

Proof. Suppose aob. Then ua=ub. Therefore wac = ubc for all c€ 8. 
This means that acobc, so that o is a right congruence. Let e be any solution 
of we—u (which exists because u is a strict generator). Then for a€ VX, 
ued = ua so that eaca. This shows that e is a left identity modulo «. Now 
define a 1-1 mapping z of M onto S/o by: aa7= {a: ua=-x}. Then it is easy 
to see that = so that is an isomorphism. 

Note. Conversely, it is easy to see that if o is a right congruence on S 
and there is a left identity modulo o, then S/o is strictly cyclic (each o-class 
which contains a left identity modulo o being a strict generator of S/o). 

We define the kernel-congruence x of an operand Myx by: axb if and only 
if ca= 2b for all x€ M. « is a congruence on S. x is the identity relation 
if and only if J/g is faithful. 

Proposition 1.2. Leto be a right congruence on a semigroup S, e u 
left identity modulo o, « the kernel-congruence determined by the operand 
Ss/o. Then axb if and only if cascb for all c€ 8S. Moreover x is the largest 
left congruence (and therefore the largest 2-sided congruence) contained in o. 
In particular, Ss/o is faithful if and only tf there is no left (equivalently. 
no 2-sided) congruence contained in o except the identity relation. This is 
equivalent to the condition that a=b whenever for all c€ 8 cascb. 

Proof. If axb, then Soa=Zob (co denoting the operation in Ss/c) 
where ¥ is any o-class. This means that caccb for all c€ S, and the reasoning 
is reversible. Next we show that « is contained in o. Assume axb. Then 
easeb, Whence aob. Now suppose 7 is any left congruence contained in o. 
Then arb implies carcb for all c€ 8S, so that caoch. Hence axb. This shows 
that + is contained in x. The rest of the proposition is then clear. 

CoroLtLary 1.3. A necessary and sufficient condition for a semigroup S 
to have a strictly cyclic faithful representation is that there exist a right 
congruence o on S, such that there is a left identity modulo o, and such that 
xs contains no left congruence except the identity relation. 

CoroLtuary 1.4. A sufficient condition for S to have a faithful strictly 
cyclic representation is that S have a left identity (and in this case Sg is 


strictly cyclic). If S is commutative then this condition is also necessary. 
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2. Transitive representations of arbitrary semigroups. We call . C 1/ 
invariant if XS CX, where = X,a€ 8}. Thus, for My to be 
transitive (which means that each element of M is a strict generator) is 
equivalent to the assertion that the only invariant subsets of WV are M and (), 
For Mg to be 0-transitive means that ./ contains an invariant element z and 
every «€ M with xz is a strict generator. Let us call Ms the null operand 
MS|=1. Then it is easy to see that the following 


of cardinal 2 if | M | =2, 

conditions are equivalent: 

(1) Wg is either 0-transitive or the null operand of cardinal 2 

(2) J has an invariant element z, and has no invariant subsets except /, 
{z} and @. 

We call Ws primitive if the only congruences on WV are the identity 
relation and the universal relation. It is not difficult to show that if | M | =3 
and WV is primitive, then J/ is either transitive or 0-transitive. 

Examples. Wet M be any set. If 8 consists of all transformations on JV, 
Mg is primitive and transitive. If | /|=3 and S’= {ae S:| Ma|=—1}, 
then My is transitive but not primitive. If |M|22, z€M and 
T = {a€ 8: za=z}, Mr is primitive and 0-transitive. If | 23, 
and 7’ = {a€ T:|(M— {z})S8|=1, then My is 0-transitive but not primi- 
tive. Let Ms be any operand. Let x¢ M, XY =a8U {x}. Define o on Vy by: 
woy if and only if wy or 28}. Then is transi- 
tive or 0-transitive, according as Z = or ZAW. 

Proposition 2.1. Leto be a right congruence on 8S. The following are 
equivalent : 

(3) is transitive 
(4) For all a,be€ S there exists c€ S with acob 
(5) Each o-class meets each right ideal of 8S. 
PROPOSITION 2.1’. Let o be as in 2.1. The following are equivalent: 


(6) Sys/o is 0-transitive or the null operand of cardinal 2 
(7) One o-class RAS is a right ideal, and for all ac S—R, bE S there 
exists c€ S such that acob 
(8) One o-class RAS is a right ideal, and every o-class meets every right 
ideal which is not contained in R. 
Proof. First note that for a g-class A to be a strict generator of S/o 
means that for all a€ A, b€ 8 there exists c€ S with acob, or equivalently 


that every right ideal which meets A meets every o-class. Hence (4) and 


(5) each assert that every o-class is a strict generator, and this is equivalent 


a 
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to (3). For 2.1’, note that a o-class A is an invariant element of S/o if 
and only if A is a right ideal. Thus (7) and (8) each assert that | M |= 2, 
that R is an invariant element of S/o, and that every element of S/o other 
than FP is a strict generator. This is equivalent to (6). 

Nole. Many results (like the preceding proposition) are stated in two 
forms (one for transitive, the other for 0-transitive, representations). Here- 
after we shall prove only the primed proposition of each pair. The other 
may be proved by similar methods. 

CoroLLaRY 2.2. A necessary and sufficient condition for S to have a 
faithful transitive representation is that there exist a right congruence o 
on S such that o satisfies (4) and contains no left congruence [ equivalently, 
no 2-sided congruence| except the identity relation and such that there is a 


left identity modulo o. 


COROLLARY 2.2’. A necessary and sufficient condition for S to have a 
faithful 0-transitive representation is that there evrist a right congruence o 
on S such that o satisfies (7) and contains no left congruence [ equivalently, 
no 2-sided congruence| except the identity relation and such that there ts a 


left identity modulo and | 8/o| = 2. 

Proof. Combine 1.1, 1.2 and 2.1’. 

CoroLLARy 2.3 [2.3’|. A sufficient condition for S to have a faithful 
[0-| transitive representation is that S be right [zero| simple and have a 
left identity. (In this case Sg is [0-| transitive and faithful.) 

Proposition 2.4 [2.4’]. A necessary and sufficient condition for a 
commutative semigroup S to have a faithful [0-| transitive representation is 


that S be a group [to which a zero-element has been adjoined]. 


Proof. Given a faithful 0-transitive representation, obtain o by 2.2’. 
By commutativity o is a 2-sided congruence. Hence, o is the identity relation. 
Hence S contains an actual left identity, e, and is right 0-simple with 0- 
element z. Clearly, S — {z} is an &-class containing an idempotent (viz., ¢), 
and hence is a group, so that S is a group with zero. The converse follows 
from 2.3’. 

3. Transitive representations of semigroups with finiteness conditions. 
If FR is a right ideal in 8, Rg denotes the operand in which S acts on FR by 
ordinary right multiplication. 

Proposition 3.1 [8.1’]. Let Mg be a [0-] transitive operand [with 
variant element z|. Let R be a right ideal of S [such that MRSA {2}. 
Then Ms = for some congruence o on 


(| 
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Proof. Let wé€ M be such that uk A {z}. Then w is a strict generator. 
Hence by 1.1 = where is given by: amb if and only if 
Let o be z restricted to R. Define ¢: S/r—> by IP —11N RK, where 1 
is a w-class. Let J={aeS: ua=z} be the (unique) z-class which is a 
right ideal. Then R—JAQM. Hence by 2.1’ R meets every x-class. Hence 
¢ is onto. It is then clear that ¢ is an isomorphism. 

Proposition 3.2 [3.2’]. Let R be a [non-| minimal | principal] righ! 
ideal in S. Let o be a congruence on S [such that all non-generators in I 
are contained in a single o-class 3]. Then Rg/o is a [0-] transitive operan: 


[having & as its (unique) invariant element]. 


Proof. The set J of non-generators in F# is the largest proper invariant 
subset of Rs. Thus Rs/o has no invariant subsets between {} and the whole 
of R/c, since & is the image of J under the natural mapping R—> R/o. Clearly 
~ is an invariant element of R/c, so that R/o is 0-transitive. 


Pro osition 3.3 [3.3’]. Let R and o [and J] be as in 3.2 [8.2’]. 
Then the kernel-congruence x of the operand Rg/o ts given by: axb if and 
only if cacch for all c€ R. In particular Rg/o is faithful if and only if for 
all a,b€ S with ab there exists c€ R with cagcb. 


Proof. Clear. 


THEOREM 3.4. Lel R be a minimal right ideal in 8S. Then the fol- 


lowing are equivalent: 


(9) S has a faithful transitive representation 
(10) ca=cb for all c€ R implies a=b 
(11) ca=cb for all c€ K implies a=b, where K is the union of the 


minimal right ideals of 8. 


Proof. (9)=> (10): Let Wg be faithful and transitive. Then M = Rs/v. 
where o is as in 3.1. Hence (10) follows from the last part of 3.3. 


(10) > (11): Clear, since RC K. 

(11)> (9): Rg is transitive by 3.2 (with othe identity relation). 
Suppose Ps is not faithful. Then there exist a,b € S with a~b and ca =cb 
for all ce R. By semigroup theory K—<SR. Hence for all dé K, there 
exists f€ R, g€ S with da=gfa—gfb—db. This contradicts (11).. 


THEOREM 3.4’. Let S be a semigroup which satisfies the minimum 
condition on principal right ideals. Then S has a faithful 0-transitive repre- 
sentation if and only if there exists an &-class R of S, not a minimal right 


— 
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ideal, such that (for a,b€ S) a=b whenever for all ce R either ca= cb 
or Ca, R. 

Proof. First let Mg be faithful and transitive. By 1.1 W=S/o for 
some right congruence o on S. By 2.1’ one o-class & is a right ideal and 
~~ 8. Let Rk’ be minimal in the class of all principal right ideals not con- 
tained in =. Let R be the ®-class consisting of all generators of 2’. By 3.1’ 
M = R’s/o for some congruence v on R’s. By 2.1’ SN R’AW. Hence if 
R’ were a minimal right ideal } would contain R’, contradicting the definition 
of R’. Moreover if a€(R’ —R) —¥X then & would not contain the right 
ideal aS U {a} contradicting the minimality of Rk’. Thus we can apply 3.3’, 
so that for all a,b€ S with ab there exists c€ Rk’ with cagcl. Hence 
ca cb. Moreover ca and cb are not both in & (since & is contained in a 
single o-class). Thus we have shown that F satisfies the condition stated in 
the theorem. Conversely, suppose such an @-class R exists. Let R’ be the 
principal right ideal generated by R. Define o on R’ by: acb if and only 
if either a—b or a,b ¢ R. By 3.2’ R’s/o is 0-transitive. By 3.3’ R’s/o is 
faithful. (Note that we used the minimum condition only for the proof of 


the necessity of the condition.) 


4, Examples. We call S (right) transitive if S has a faithful transitive 
representation, and left transitive if S is anti-isomorphic to a transitive semi- 
group. It is clear that the results of Sections 2 and 3 concerning transitive 
semigroups can be applied to left transitive semigroups by interchanging 
“left” and “ right.” 

By 2.4 anv commutative semigroup which is not a group is neither left 
nor right transitive. Moreover if S has a right ideal R such that ca=cb 
for all a,b€ S, c€ R then (as an immediate consequence of 3.1 and 3.3) 
S is not transitive. In particular, a semigroup with a zero element is neither 
left nor right transitive. If S is completely simple in the sense of Rees [5| 
with structure matrix P then it is easy to show (by 3.4) that S is transitive 
if and only if P has two rows which are not left multiples of each other. 
3.4 gives a rather explicit condition for any semigroup having a minimal 
right ideal to be transitive. 

We now give examples not having minimal right ideals. 


Example 1. The bicyclic semigroup, which consists of all pairs of non- 


negative integers with multiplication given by: 
(i, j) (k,l) = (i+ b—min(j,k), j + k—min(j,k)), 


is transitive on both sides. The relation o, given by: (1,j)o(k,1) if and only 


1 
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if jl, satisfies 1.3. (co coincides with the relation & in the sense of 
Green [3].) 


Example 2. Let S be a semigroup containing at least two elements, 
T any semigroup, and P the free product (either with or without an identity 

5 
element adjoined) of S and 7. Then P is transitive on both sides. 


Proof. Choose a fixed a€ 8. Let Q be the set of all elements of P 
of one of the following forms: ¢, where t¢€ T; st, where aAs¢ es, t€7: 


n m 


I] I] siti, where m=—2n, T. Now, given any pe P, 
4=1 i=n+1 


write p (uniquely) as a product of elements belonging alternately to S and 
to T. From the left of the sequence thus obtained, delete repeatedly elements 
of Q until this is no longer possible. Call the resulting (possibly empty) 
sequence the normalized form of p. Define a right congruence o on P by: 
poq if and only if p and q have the same normalized forms. Then it is easy 
to see that o satisfies 2. 2. 

In particular, the free semigroup of rank n = 2 is transitive on both sides. 
(Recall that by 2.4 the free semigroup of rank 1 is not transitive.) 


Example 3. Let S= {a}, T={b}. Let P be the free product (with 
or without an identity) of Sand T. Then P is neither left nor right transitive. 


Proof. For i=a or b, let (1,n)€P denote the product in P of n 
elements, alternately a and b, the first one being 7. Suppose P is (say) right 
transitive. Let o be as in 2.2. By (5) the o-class containing a meets the 
right ideal {(a,n):n>1}. Let p be the least integer with ao(a,p) and 
p> 1. Similarly let q be the least integer with bo(b,q) andq>1. Case I: 
one of p, q (say p) is even. Here abo(a,p) so that p—2. Hence ac(a,n) 
for every n. The o-class containing b must meet the right ideal {(a,n): n= 1}. 
Hence aob. Hence P consists of a single o-class, contradicting 2.2. Case IT: 
p and q are both odd. Here it is easy to see that for all 2€ P we have 
(p—1)(q—1)) and yet aA(a,1+ (p—)(q—1)). This 


contradicts the last sentence of 1. 2. 


Example 4. Let M be an infinite set. Let S be the Baer-Levi semi- 
group (given in [1]), which consists of all 1-1 transformations of M into 
M such that | M@—Mr|=|M |. is obviously right transitive. We shall 
show that S is also left transitive. 


Proof. Order M by the first ordinal whose cardinal is | |. Then for 
NCM, |N|=| |, it follows that N (ordered by restriction) has the same 
ordinal as 7. Hence there is a (unique) order isomorphism zy of N onto WV. 


= 
— 


REPRESENTATION OF A SEMIGROUP. 541 


Given 2€ S, rua is a permutation of WM. Define o on S by: ao@ if and only 
if rua == 7ug. It can be shown that o is a left congruence, that each o-class 
meets each left ideal, and that given «,B¢€ S with «8, there exists yc 8 
with ay and By in different o-classes. 


Example 5. The direct product of any number of transitive semigroups 
is transitive. 


Example 6. Let S be a semigroup. Let P = (pi) be an m Xn matrix 
of elements of S. Let T be the Rees matrix semigroup, which consists of all 
nm matrices having an element of S in one place and zeros elsewhere, 
multiplication being defined by Ao B=APB for A,BET. This semigroup 
was considered in [7] and [5] for the case where § is a group. 

Two rows, i and j, of P will be called dependent if, for some a,b€ 8S, 
Pit = pyb for all k. Now if S is right transitive and no two rows of P 
are dependent, 7’ is right transitive. (For, given o on S as in 2.2, identify 
two elements A = (aj;) and of P whenever for all i, 7 either 
= bi or This gives a right congruence having the desired 
properties.) On the other hand, if two rows of P are dependent, then 7 is 
not right transitive. 
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ON DIFFERENTIALS IN FUNCTION FIELDS.* 


By Oscar Zariskt? and PETER FALB. 


Introduction. In order to develop a theory of differentials in algebraic 
functions fields K, of one variable, over imperfect ground fields k, and—above 
all—in order to prove in this case the classical formula which relates the 
order of dx at a place of K/k to the order of the different D, at that place, 
Chevalley, in his book on Algebraic Functions of one variable, defines differ- 
entials by means of repartitions in K/k. The object of this paper is to show 
that also in the case of imperfect ground fields it is possible to develop the 
theory by using the conventional (formal) definition of a differential, without 
having recourse to repartitions. The advantage of our approach is that it 
enables one to deal with differentials dr,dxr.- - -dx, in function fields K//: 
of r independent variables (where repartitions are not available if r >1), by 
identifying (in an obvious sense) such a differential with the differential dv, 


in the field K/k(2x2,: + -,a,) (if the characteristic p of k is different from 
zero, our new ground field k’ =k(a.2,- - -,2,) will not be perfect, even if the 


original ground field / was algebraically closed.). 

Our proofs are essentially based on methods which have been developed 
by one of us in a paper on simple points (see Zariski [1]), and, in particular, 
on the mixed Jacobian criterion for simple points, established in that paper. 
As in that paper, so also here, we fix a model (normal curve) of K/k, we 
observe that any such model is already defined over k?(u,,- - -,up), where 
the u; form a finite set of p-independent elements of k, and we deal with 
derivations with respect to the “ p-parameters” wu; This allows us to define 
the notion of “ uniformizing coérdinates ” (which have been defined previously 
only for absolutely simple points) and to establish a crucial criterion for 
uniformizing codrdinates (see $1, Theorem 1). The notion of the order of a 
differential dr at a place p is defined by using uniformizing codrdinates (sec 
§ 2, Definition 2). The role of Theorem 1 in the proof of the main result 
concerning the different (§3, Theorem 2) is that it allows us to reduce at 
once the proof of Theorem 2 to the case of cyclic Galois extensions, of prime 

* Received May 10, 1961. 


* This research was supported by the Air Force Office of Scientific Research of the 
Air Research and Development Command, under Contract No. AF 49 (638) -494. 
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order. This cyclic case is treated in the last two sections of the paper, and 
here again the use of Theorem 1 is essential.t 

We note that in the case of function fields K/k of r independent variables 
- +,a,, Where we take as new ground field the field &(a2,---,2,) (k- 


algebraically closed), the variables x.,- - -,, play precisely the role of the 
p-parameters U;.° * *,Up. 


1. Uniformizing coordinates. Let V be an irreducible, r-dimensional 
attine variety, immersed in an affine space S, and defined over a ground field & 
of characteristic p40. We assume that the function field K of V over k 
is a separable extension of k. Let +, up be a finite p-independent set 
of elements of / such that the ideal, &(V), of V in k& [Xi,---,X,] has a 
basis - -,fy(X) of polynomials with coefficients in k?(u1,° -,Up) ; 
and let k,; be a maximal subfield of &, containing hk? and such that 
[hy ky] We note that +, up). We will call 
Uy. * *;Up p-parameters of V/k, and k, a field of p-constants associated with 
the p-parameters w,,° * *,Up. Any finite p-independent set U which contains a 
set U, of p-parameters of V/k is also a set of p-parameters of V/k, and if ky 
is a field of p-constants associated with U, then we may take a suitable sub- 
field of ky as field of p-constants associated with U,. This possibility of 
enlarging a set of p-parameters (and correspondingly shrinking the field of 


+ Note added by O. Zariski in galley proofs. During my visit to Frankfurt in June, 
1961, Dr. Ernst Kuntz has called my attention to his most recent paper “ Differential- 
formen inseparabler algebraischen Funktionenkérper ” (Math. Zeitschr., vol. 76 (1961), 
pp. 56-74) in which an essentially similar treatment of differentials over imperfect 
ground fields is found. In many respects the work of Dr. Kuntz goes farther than ours, 
namely (1) he does not assume that K/k is separable and (2) he deals also with 
function fields of varieties of arbitrary dimension (this latter generalization has also 
heen carried out by Dr. Falb in his unpublished Harvard dissertation; in their presen- 
tation of the behaviour of differential forms on complete algebraic models of K/k, both 
Kuntz and Falb follows the procedure given by me in my lecture notes “ Introduction 
to the theory of Algebraic Surfaces,’ Harvard University, 1957-1958). On the other 
hand, our characterization of uniformizing coérdinates (Theorem 1) is not in Kuntz’s 
paper. As this theorem plays an essential role in our paper, our treatment differs from 
that given by Kuntz in a significant way. 

The paper of Kuntz is followed immediately, in the same issue of the Mathematische 
Zeitschrift, by a paper of Hans-Joachim Nastold, entited “ Zum Dualititssatz in in- 
separablen Funktionen = kérpern der Dimension 1” (pp. 75-84). In this paper Dr. 
Nastold, using the work of Kuntz, proves the duality theorem for inseparable function 
fields K/k of dimension 1. Thus, on the basis of these two papers by Kuntz and Nastold, 
we have now, also for inseparable function fields, a complete formulation of the Riemann- 
Roch theorem, in which the differentials (or—equivalently—the canonical divisor class) 
appear in their conventional form (and not as repartitions ) . 
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p-constants) allows us to deal with the case of a finite set of varieties over k; 
i.e., there will then exist elements u,,---,Up and a field &, such that 
{u1," - *,Up} is a set of p-parameters having k, as associated field of p- 
constants, for each variety of the set. For the moment, however, we shall 
assume that the p-parameters -,Wp and the field of p-constants have 
been fixed. As we shall consider derivations of K and of other fields or rings 
containing k, we stipulate now that al’ derivations will be tacitly assumed to 
be trivial on k, (but not necessarily on &). 

Let W be an irreducible, s-dimensional subvariety of V/k, let © be the 
local ring of W on V and let t be the maximal ideal of ©. We shall use the 


following notations: 
D (K) =space of derivations of K (over k,, by our convention) ; 
D(W) =set of derivations D of K which are regular on W, i.e., such 
that DO CO; 
D*(W) = space of local W-derivations D* on V, i.e., derivations D* of 
©, with values in the residue field k(W) of ©; 


D(W) =space of derivations of k(W). 


D(K) and D(W) are vector spaces over K and &(W) respectively, 
D(K) being of dimension r+ p (since K/k is separable and [k: k,] =p’): 
D(W) is an O-module while D*(W) is (canonically) a &(W)-vector space 
({1], p. 34). If Wis simple on V, then dim D*(W) —r-+ p as a consequence 
of the Jacobian criterion for simple subvarieties ([1], p. 39). We have a 
mapping D— D* of D(W) into D*(W) given by D*¥z = Dz, z€ D (where 
a bar above an element of denotes the Mi-residue of that element). Every 
element D of D(W) can be lifted to an element D* of D*(W) by setting 
D*z= Dz, z2€D. The mapping D— D* is clearly an injection. 


DEFINITION 1. elements, -,&psr, of O are uniformizing co- 
ordinates of W if there exist p+ r derivations D; of K, regular on W, such 
that Dé = 85 -,p +71). 

ProposiTIon 1. Jf W is simple on V, then there exist uniformizing 
coordinates of W. 

Proof. Let &(V)-residue of X; and let psn stand for 
Ui,° *,Up, * some order (where -,Up are the p-parameters 
We may then assume that the Jacobian determinant 
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is not zero on W (i.e., is not an element of Mt), since the rank of the Jacobian 
matrix is n—r ([1] p. 39). It follows that 
there exist D,,- - -, in D(K) such that = 8; (4,7 =1,2,° -,p +1) 
and that these derivations are uniquely determined. We have, for 


Oni 


For fixed i, these equations, viewed as equations in the Dinp.,.,, have deter- 
minant which does not belong to Mt. Hence p+). 
But © is a quotient ring of k,[m1,° - -.npsn]; therefore, the D; are regular 
on W. Consequently -,yp.r are uniformizing coordinates of W. 


Coronttary. If W is simple on V, then D(W) is a free O-module of 
rank p+r, and tf &,: + -,&pir are untformizing coordinates of W and 
D,,: + +, Dos are as in Definition 1, then the D; form a free basis of D(W) 
over D. The mapping D> D* of D(W) into D*(W) given by D*z = Dz, 
2€D, ts a surjection, with kernel MD(W). 


Proof. It is clear that the derivations D,,- - -,Dp,, of Definition 1 are 
linearly independent over © and therefore form a basis of D(K) (over K). 
If De D(W), then K, and since = it follows that 
the A; are in D. Hence the D; form a free O-basis of D(W). The second 
assertion follows from the fact that rank D(W)/D =—dim D*(W)/k(W) 


(=p+r). 


Proposition 2. Let be uniformizing coordinates of W and 
J 

let >, be derivations of K, regular on W, such that D’inj = 

Then, given p+r elements -,épu in a necessary and sufficient 


condition that these elements be uniformizing coordinates of W is that the 
determinant | D’;é;| does not belong to M. 


Proof. Assume that the ¢ are uniformizing coordinates of W and let 
D,,- - +, Dow be as in Definition 1. Since the D’; form a basis of D(K) 
over K (actually a free D-base of D(W), by the previous corollary) we have 

ptr 
> AD’; with K (in fact, O), and Ay = Din;. Applying D; to 

j=1 
Sq Wwe find that (Din;) (D’¥_), which shows that || Diy; || - || || is 
j 


the unit matrix. Since the determinant of each of these matrices is in 9, 


each determinant is a unit in ©. 
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Conversely, if | D’ié;| ¢ WM, then the matrix || Dé; || (whose entries are 
in 9, as D(W), and D) has an inverse || vy ||, with vi;E Set 
Then D(W) and Dé; = from which it follows that 

j 


the €; are uniformizing coordinates of W. 

In the proof of our next theorem, which characterizes uniformizing co- 
ordinates, we shall need a lemma concerning the function field of an arbitrary 
irreducible variety in S, (over &) and in which it is not assumed that this 
function field is separable over k (as we later apply this lemma to the 
subvariety W of V). 


Lemma 1. If an wrreducible, s-dimensional variety W/k in 8, admits a 
field k, of p-constants and if [k: k,] =p, then dim D(k(W)/k,) Ss-+ p. 
and any set of generators of k(W) over k, contains a subset of o elements, 
Z1,° *,%0, where o=—dim D(k(W)/k,), such that k(W)/k,(21,° is 
separable algebraic. 


Proof. By assumption, the ideal of W in k[X,,---,X,] has a basis 
{9i(X1,° Up); t=1,2,- --,N} of polynomials with coeffi- 
cients in k?[u,,- - -,up], where the wu; are p-parameters of W/k associated 
with the field k,. Since W/k is simple on S,, we know ({1] p. 39) that the 
mixed Jacobian matrix J(g1,° - -,gv3X1,° *,Up) has rank n—s 
on W (i.e., at a general point of W/k). If we denote by Y,,- - +, ¥nip the 
elements *,Up (in some order) and by their 


2(W)-residues, then we may assume (say) that 


at (Y) = (é). This implies that - -,€psn are separable algebraic over 
ki As k(W) =k, (&,- énp), we have that k(W) is separ- 
able algebraic over k,(&,- - -,€p.,;), and it is well-known ([2], Theorem 41, 


p. 127) that this implies that dim D(k(W)/k,) =s-+ p. 

Let {y,} be a set of generators of k(W)/k, and set L=k(W). Then 
every derivation of L/k, is trivial on k,(Z*), and L is a purely inseparable 
extension of k,(Z*), with [L: k,(L*)] =p’, c=dim D(L/k,). We can 


extract from {y,} a set {y1,° - -,yo} of elements such that 
(L?) (y1,° yo): ky (L”)] =p’, 
and then we can construct o linearly independent derivations D,,- - -, Do of 


L/k, such that Diy; =68; ([2] Corollary 5, p. 125). It is clear that any 


derivation of L over k,(y:,- -,yo) vanishes on LZ. Therefore is a separable 
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algebraic extension of k,(y:,: *-°,yo). This completes the proof of the 
lemma. 


THEOREM 1. Let W be an wrreducible subvariety of V/k and let 
fuy,° * *,Up} and k, be p-parameters and associated field of p-constants for 
V/k. If W ts simple on V/k, then, in order that given elements &,,- + -, Epsr 
of D be uniformizing coordinates of W, it is necessary and sufficient that the 
following two conditions be satisfied: 


(i) k(W) is a separable algebraic extension of ky(&,° where 
=, = M-residue of & (D is the local ring of W on V; WM is the maximal ideal 


of 


(ii) ki [&.° +. contains a set of uniformizing parameters of W 
on V/k. 
Proof. Suppose that é,,- - -,&p., are uniformizing coordinates of W. Let 


D(W) and assume that D =0 on -,Epsr). To establish (i) it 
will suffice to show that D0. Let D* be the local derivation of D obtained 
by lifting D to D*(W). It will suffice to show that D* =0. Let D,,---, Dos 
be as in Definition 1 and let D*,,- - -,D*p,, be the corresponding elements 
of D*(W). Since D*é;—=8);, the D*; are linearly independent over k(W) 
and therefore form a basis of D*(W) over k(W). However, D*é;==0 for 


and It follows imme- 
diately that D* — 0. 
We first prove (ii) under the additional assumption that w,,° - -,up and 


k, are p-parameters and associated field of p-constants not only for V/k but 
also for W/k. 
From (i) it follows that dim D(k, (E,,° +, Eur) = dim D(W) (all 


derivations being assumed zero on k,). Therefore, by Lemma 1, 
dim D (ki +, Spts, 


and thus, by known results on derivations of fields, it is possible to extract 
from a subset of p+ elements, say &,° such that 
is a separable algebraic extension of k, (é,.° (com- 
pare with the proof of the second part of Lemma 1). Let Fy 
(A=1,: - -,r—s) be the minimal polynomial of over + 
F\ may be assumed to be a polynomial in &,,- - -,&p,. (as well as in Y). Let 
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Then 7 € MO k[é&,- - +, &s,], and in order to prove (ii) it will be sufficient 
to show that if there is a relation of the form §& ar € M?. a, € O, then the a, 
must belong to Yt (since the 7, will then be uniformizing parameters of W 
on V, i.e., regular parameters of ©). Now we have D(S\ayr)) € M for any 
D in D(W), and, in particular, for »—1,- - -,r—s, 


Dossen( = + Ay (Dosssutr) € Me. 


Hence (Dosseutr) € Mt. But, by (1), we have = 0 if and 


Dpsssrtrn (since is separable algebraic over h,(&,°- and 

therefore, AY (E1,° Epssen) AO). Thus ay(Dosssatr) € Mt and ay € M. 
In the general case, we proceed as follows: 

Je fix in k, a finite set of p-independent elements tp,.,.° * *,Uo such tha 

We fi k, a finite set of p-ind lent elements u, h that 

the ideal &(1W) has a basis of polynomials with coefficients in k’(u,,- uc). 


Then we fix a maximal subfield k. of k, such that 
< ks, ky and [k,: k.] =p. 


Then it is clear that u,,- - -,Uo are p-parameters, and /, is an associated field 
of p-constants, for both V/k and W/k. We now use the notations of the 
proof of Proposition 1. The coefficients of f1,- -,fn+ are in -,Up), 
and J It follows from that proof, as 
applied to u;,° and kp, that m,° Uo are uniformizing 
coordinates of W relative to Let D’s,- - be the 
corresponding derivations of D (over k.). We have therefore 


Ding = 815, (1,7 = 1, 
and 
4 € 
(2) = 0, (t= 
The relations (2) show that D’;,- - -,D’p,,. are zero on k,. Since both 
and -,ép,, are uniformizing codrdinates of W, over ky, 


we have, by Proposition 2, that the determinant of the derivatives D’jé; 
(t,7=1,2,---,p+r) does not belong Mt. In view of (2), and since 
= it follows that also the (o-+-1)-row determinant of the 
derivatives 


is not in Yt. Therefore, by Proposition 2, Uo are uni- 
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formizing codrdinates of W, over k, Hence, by the preceding special case 


of the proof, the ring - °°, Ue] contains a set of uni- 
formizing parameters of W on V/k. Since +, Uc] the proof 
of (ii) is now complete. 

Conversely, assume that &,- - -,&»,, satisfy conditions (i) and (ii). Fix 
a set {m,° * *ynpsr} of uniformizing coordinates of W (Proposition 1) and 
let D’1,- + -,D’px be as in Proposition 2. It will suffice to show that 
| D’é;| ¢ Mt. Suppose this is false. Then there exist elements a,,- ° -, dps 


of not all in Mt, such that a;(D'é) € WM +7). Let, say, 
a,¢ Mt and set D— aD’, Then De D(W) and Dé€ M, from which it 
follows that DI CM (by (ii)). Hence D—Trw D is defined (by DZ = Dz, 


z€) and is an element of D(W). Since Dé; —0, we have D0 (by (ii)), 
in contradiction with Dyn, =a, 340. This completes the proof of the theorem. 


Corotiary. Jf W is as in the Theorem 1, then there exist uniformizing 
coordinates Of W such that m,° (say) are uniformizing 
parameters of W on V. 


Proof. Let &,° - *,&psr be uniformizing coordinates of W (Proposition 
1) and suppose that k(W) is separable algebraic over k,(&,° (com- 
pare with the second paragraph of the proof of the theorem). Let t,,- - -, ts 


be uniformizing parameters of W on V and set 


It is immediate from the theorem that the »; are uniformizing coordinates of 
Won V. 

We remark that all the results established in this section continue to 
hold (and are well-known) also in the case of characteristic zero. In this 
case. we have k = k?, p=0, and k, =k. 


2. The order of a differential at a prime divisor. From now on we 
shall assume that r—=1. Let p be a prime divisor of K/k and let v= vy be 
the valuation of K/k defined by p. Since for function fields of one variable 
the notion of a prime divisor is equivalent with that of an irreducible, 0- 
dimensional subvariety W of a normal (hence non-singular) model V of K/k, 
we can speak of uniformizing coordinates of ), relative to a given set of p- 
parameters w;,° and an associated field of p-constants. 

Since the space D (K) of derivations of K (over k,) has dimension p + 1, 
we may speak of differentials of K, of degree p + 1, and for any p + 2 elements 
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A,%,° of K, we have a differential Adz,,- - -,dzp,; of K. Further- 
more, if €,,- - -,&p.: are uniformizing coordinates of p, then every (p+ 1)- 
fold differential of K can be written in the form Adé,- - :dépi1, AE K (as 
an immediate consequence of the fact that the derivations D,,- - -,Dp.: of 


Definition 1 form a basis of D(K)). 


DEFINITION 2. If x is a separating transcendental of K/k, tf 


are uniformizing coordinates of p, and if dadu,- dup =Adé,: 
(where the u; are the given p-parameters), then the order of dx at p (in 
symbols: v(dx))\ is the integer v(A). 

Proposition 3. The order of dx at p is well-defined, t.e., v(dx) is 
independent of the choice of the uniformizing coordinates &, of the p-para- 
meters u,, and of the field k, of p-constants. 


Proof. 

(a) That our definition of the order of dx is independent of the choice 
of the (uw, - *,u%p and k, being fixed) follows immediately from Proposi- 
tion 2. 


(b) We next remark that it will suffice to prove the proposition for one 
particular separating transcendental x of K/k, which we shall take as one of 
the generators of K/k. For if 2’ is another separating transcen- 
dental of K/k and if we set dx’du,: - dup = A’dé,- - dé&p,,, then A’ = AD,.’. 
where D, is the (unique) derivation of K over k such that D,7 =1. 


(c) We now demonstrate that for fixed u,,---,up our definition is 
independent of the field &, of p-constants (we recall that &, is any maximal 
subfield of containing k?, such that -,up): ki] and is 
therefore not uniquely determined). We may, by (a), choose as uniformizing 
coordinates a suitable subset 71,° of the set {m,- of &(V)- 
residues of the elements -,Xn, U:,° (arranged in some order). We 
use here the notations of the proof of Proposition 1. Let Ky = k?(m,- °°, psn). 
Then K, depends only on (the generators 7,,°- of K/k 
having been fixed; Ky is in fact the function field of V over k?(u,,- - ‘,Up)). 
Let D,,- -,Dps: be the derivations of K/k, such that Din; = 8), i, 7 = 1. 

‘,;p +1, and let D°,,- - -,D%,, be their restrictions to Ky. The deriva- 
tions D°; of Ky are independent of the choice of k, since they are uniquely 
determined by the conditions Din; = (compare with the proof of Proposi- 
tion 1). Since Ky, and since A Up) 


(where the right hand side denotes the Jacobian determinant relative to the 
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derivations D°,,- --,Dpo.1), our assertion that, for given p-parameters 
U1," * *,Up, the order of dx is independent of the choice of k, is proved. 


(d) We next remark that if w,,---°,wWp are p-parameters such that 
k?(w1,° Wp) = +, Up), then drdw,: - -dwp = cdrdu,: - dup, where 
Up), C0. For if +, 8p are the derivations of up) 
such that 8u;—0 if iAj and §ju;=—1, then c is equal to the Jacobian deter- 
minant 0(w,,° *,Up). It follows that is the same 
for two sets of p-parameters which generate the same field over k?. 


Let {u,,° >,Up} and {w,,: -,wr} be any two sets of p-parameters and 
the sets {u,,- and {w,,- - -,w,} to sets of p-parameters {U;,° *,Uo} 
and {w,,° °,Wo} such that -,uc) =k?(w1,- +,wo). Hence, in 


view of (d) the proof of our proposition will be complete if we show that the 
order of dx is unchanged when the set of p-parameters is enlarged. 

Now let +, Up}, {t1,° Up, Ups,’ Uc}, > p, be two sets of 
p-parameters one of which is contained in the other. Let k’, be a (fixed) 
field of p-constants associated with {u1,: - -,uUc}. Then we can set, by (c), 
hy =k’; Uc), so that k’; C k,. We use the notations of the proof 
of Proposition 1. The coefficients of f,,- +, fn+ are in +, Up), and 
persis’ € Wt. It follows from that proof, applied to 
and k’,, that m,° Uo are uniformizing coordi- 
nates of p relative to k’;. If D%po.1,- - -,D%o are the derivations of 


(15° * *,Uc) 
such that D°aug = dag (% and if 


then 


and our assertion is proved. 


38. The main result. Let K/k be a separably generated function field 
in one variable and let K’ be a finite separable algebraic extension of K. In 
all that follows, we shall tacity assume that a set of p-parameters w1,° - -, Up 
of both K/k and K’/k (and similarly a field k, of p-constants of both K/k 
and K’/k) have been fixed in advance.? Let p be a prime divisor of K/k; 


*T.e., p-parameters of a normal model of AK/k and of a normal model of K’/k. 
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let p’ be a prime divisor of K’/k lying over p; and let x be a separating 
transcendental of K over k (and hence also of K’ over k). We denote by dxX 
(respectively dx*’) the differential dx viewed as a differential of K (respec- 
tively of K’). We shall use the following notations: 


(dx*),, = local p-component of the divisor of i.e., vp(dz™) -p; 
(dxX’),, = local p’-component of the divisor of dr*’, i.e., vp (dxX’) - p’; 
(dx*) = local p’-component of the extension of the divisor (dz*) to K’, 
1.€., €y:pd,(da*)p, where ey-y is the (reduced) ramification index of p’ over p; 
S) = OD, = valuation ring of p; 
=D) = valuation ring of p’; 
= integral closure of © in K’. 
The main object of this paper is the proof of the following (well-known) 
theorem : 


THeEorEM 2. =: (dx*)y + ps where is the 
local p’-component of the divisor of the different Doss of OD over O. 


CoroLtuary. Jf y is a separating transcendental of K/k and if r is a 
prime divisor of K/k such that v,(y) = 0, then v,(dy) = v,(Drxtyi), where 
R is the integral closure of k[y] in K. 

In order to prove the corollary, we need only apply the theorem to the 
two fields &(y) and K and note that the order of dy (viewed as a differential 
of k(y)) at any prime divisor of k(y) which is not a pole of y, is equal to 0 
(since ¥,U:,° ° *,Up are obviously uniformizing coordinates of such a prime 
divisor, where u;,- - -,Up are the fixed p-parameters of K/k). 

The proof of the theorem consists of a series of reductions, followed 


by a direct argument in two cases. In outline, the steps are: 

(1) reduction to the case in which K’/K is a Galois extension; then, in 
the Galois case, 

(2) reduction to the case in which K’/K is a cyclic extension of prime 
order ; 


(3) direct argument in the case of pure ramification 


(4) direct argument in the case of purely inseparable residue field 


extension. 
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The remainder of the paper is divided into four sections which correspond to 
the steps in the proof. 

Before we begin the proof, we remark that if p’ is unramified over p 
(i.e, if @yyy—1 and k(p’) is a separable algebraic extension of k(p)), then 


Theorem 2 is true. In fact, in that case, if &,- - -,ép4. are uniformizing 
coordinates of p, then, by Theorem 1, é,- - -,éps1 are uniformizing coordi- 
nates of p’. Since we have, if drdu, - dup = Adé,: - 1, that 


vy: (dak) = vy (dark) =v, (A) = vy (A) = vp (de®). However, ({2], p. 302, 
Theorem 28), vy (O5/q) =O and our assertion follows. 


4. Reduction to the case in which K’/K is a Galois extension. 


Proposition 4. Let K and K’ be as in Theorem 2 and let K” be a finite 
separable extension of K’. Then, if the theorem is true for any two of the 
extensions K’/K, K”/K, K”’/K’, the theorem is true for the third. 


Proof. Let O’ Oy” (where p” is a prime divisor 
of K’/k lying over p’) and let -, ~ denote integral closure in K’, K” 
respectively. Assume that the theorem holds for the extensions K’/K and 
K”/K. Then, we have: 


== + (D5vo)p 


However, ([2], p. 309, Theorem 31), = and therefore we 
have: 
= (dx*) + + (DSB) v” 
= (dr® + 


We now observe that ©’ is a quotient ring of D with respect to a multiplicative 
set S, and therefore we have D’ =. It follows ({2], p. 299, Theorem 27) 
that (DE,s)p" = (Ds7p)p”- On the other hand, O” is a quotient ring of 
both © and ©’, with respect to the same multiplicative set ([2], p. 47, 
Theorem 18 is used implicitly here). Therefore, we have (D5+/9’)p” 
= p”, and the assertion follows. 

The cases where the theorem holds for some of the other pairs of exten- 
sions are handled in an entirely similar manner and are omitted. 


Proposition 5. If Theorem 2 is valid for Galois extensions K’/K, it 1s 
valid for arbitrary extensions K’/K. 


= 
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Proof. Let K” be the least normal extension of K containing K’. Then 
K”/K and K”/K’ are Galois. Consequently, by Proposition 4 and our 
assumption, Theorem 2 holds for K’/K. 

We shall, therefore, assume from now on that K’ is a Galois extension 


of K. 


5. Reduction to the case in which K’/K is a cyclic extension of prime 
order. Let Ky, Kz be the splitting and inertia fields, respectively, of p’, and 
let pz, pr be the contractions of p’ to Kz and Ky respectively. Then it is 
known that pz is unramified over p ([2], p. 291, proof of Theorem 23) anid 
that pr is unramified over pz ([2], p. 292, Theorem 24). Hence, by the 
remark made at the end of Section 3, our result holds for the extensions 
Kz/K and K7/Kz. In view of Proposition 4, we need only prove the theorem 
for the extension K’/K7, i.e., we may assume from now on that K = Ky. 
The assumption K = Ky leads to the (well-known) conclusion that the Galois 
group of K’/K is solvable. It will then follow directly, from Proposition +4. 
that, if the theorem is true for cyclic Galois extension of prime order, then 
the theorem is true for K’/K. 

We shall assume from now on that K’/K is cyclic, of prime order 4. 
Then g = ep* (e = éy/y), and therefore, we have either g =e and k(p’) = k(p) * 
(case of pure ramification) or e=1, gp, and k(p’) is a purely inseparable 
extension of /(p), of degree p (case of purely inseparable residue field exten- 
sion). Before we present the direct arguments for these two cases, we prove 
a lemma relating ©’ and © (recall that ©’ is the valuation ring of p’ and 
that © is the valuation ring of p). 


LemMMA 2. There eaists an element X in K’ such that OD’ =O[A] in 
either of the two cases, and, in the case of pure ramification, X may be taken 


to be a uniformizing parameter of yp’. . 


Proof. Let be such that k(p’) —k(p)(A) if k(y’) Kk(p); 
if k(p’) = k(p), then let A be a uniformizing parameter of p’. Let O = OA]. 
M—= NM" AD, where MY is the maximal ideal in OD’. Since p’ is the only 
extension of p, D’ is integral over © and therefore also over ©. Furthermore, 
since Mt is the only prime ideal of 5, D is a local domain. We observe 
that O/M = O'/M’ = k(p) (X) and also that © contains a uniformizing 
parameter 7 of p’, where r= A or r= uniformizing parameter of p, depending 
on whether or not ge. Since A¢ K, K’ is the quotient field of ©. We 
assert that D’ Let €€ 0D’. Then + where &€ 5, is our 


Since A = Kr. 


2 


~ 
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uniformizing parameter of p’ in ©, and 7€ 0’, since D/M—=—O//M. It 
follows that, for 7 = 0, there exists a &; € © such that €—é€ MV’. Now K’ 
is the quotient field of © and so, £=a/b with a, b in S. We then have 
a— EM" ND, ie, MAH for all j=0. It follows that 
a€ Db and hence that €€ D. This completes the proof of the lemma. 

We let F(X) € K[X] be the minimal polynomial of A over K. Then, 
({2], p. 303, Theorem 29), Do75 = O’F’(A), and therefore, we must show 
that we have (dx*’),, = (drX)y, + (F’(A))y- in both of our two cases (where 


d is chosen as in Lemma 2). 


6. Direct argument in the case of pure ramification. We assume now 


that and k(p’) =k(p). Let &,- be uniformizing coordinates 
at p, where €, is a uniformizing parameter of » (compare with the corollary 
to Theorem 1). Then, it follows from Theorem 1 that A,é,°- -,&ps1 are 
uniformizing coordinates at p’. We set =A, nj for -,p+1, 
and if D; (i =1,:--,p+1) are derivations of K such that 
(i,u—=1,---,p+1), then we denote their extensions to K’ by D; We 
let E,,- - be the derivations of K’ such that Hinj . We have 
Den = — i= +1, where F?*(X) is obtained from F(A) 


p+1 


by applying D; to the coefficients of F(A). Hence we have Dj = 


where the matrix A = || a, || has the form: 
| __ /F’(A) 0) 
|  [ 
| 
| — 
Here Ip is the p by p identity matrix. It follows that, if drdu,- - - dup 
= Bdé,- - - dép.,, then 


Therefore it will suffice to show that F:(X) is a unit in ©’. 
Let F(A) + -+a,=—0, with Then we have 
— Ag + a,AT* + 444A, which implies that 
(Ag) = Vp (AT? +1 
= 1-4 min(q—1,¢—2 + +, 
Let a) =1 and let aj, be the coefficient such that a;,¢ Mt and a;€ M if j > jo 


(here necessarily 0S jy <q). Then =q—jo. But vy(ag) > 0, and 
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therefore j,=0 and vp(aq) =1. Then ag—eé,, where ¢ is a unit in , and 
we have which implies that Since F(A) 
= (D,a,)At* +- -+ (Diag) and Dya;€ while vp(A) =1, it follows that 
Up: (F?(A)) =0, and the proof for this case is now complete. 


7. Direct argument in the case of purely inseparable residue field 
extension. We assume now that gq =p and that k(p’) is a purely inseparable 
extension of k(p). We let &,- - +,ép4: be uniformizing coordinates of p, 
where we may assume (see Corollary to Theorem 1) that & is a uniformizing 
parameter at p (and hence also at p’). Now, k(p’) is (by our choice of 2) 
a separable algebraic extension of k,(&,,&,° (where k, is our field 
of p-constants). We may assume that w;,- - -,up and k, are p-parameters and 
associated field of p-constants not only for K/k and K’/k but also for p’/k. 
Under this assumption, we have, by Lemma 1, that dim D(k(p’)/k,) Sp. 
and since A¢ k(p), we may assume, by the second part of Lemma 1, that ¢, 


is separable algebraic over -,&.1,A). It follows from Theorem 1 
that if we let nj; 7 =2,° +1, then the are uniformizing 


coordinates of p’. As in Section 6, we let {D;} be the derivations of K/k, 
such that Dig; = 8; we let D, denote the extension of D;, to K’; and we let 
- be the derivations of K’ for which Fin; Arguing as in 
the previous section, we see that it will suffice to show that F:(\) is a unit 
in 9’. 

Let F(A) =A? + aA? -+a,—0, with Since k(p)(X) is 
purely inseparable over k(p), the coefficients a,,---,a@). are in Mt. We 
observe that D,Yt C M and therefore that the trace of D, on k(p) (which 
we denote by D,) is defined.* If D,a,€ M, then D, can be extended to 
k(p)(A), and we denote the extension by D’,. However, D’,7;—0 for 
j=1,---,p+1 and D,é,—1 together contradict the fact that €, is separ- 
able algebraic over k,(71.° It follows that D,a,¢ Mt. Since we have 
FP1(X) = +--+ (Dap) and M if i< p, it follows at 
once that vp: (F'?:(X)) =0 and the proof is now complete also in this case. 
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‘It is given by: Dz = Dyz, 2€ 0. 


POLYHEDRA IN S" WHOSE COMPLEMENTS HAVE UNIFORMLY 
ABELIAN LOCAL FUNDAMENTAL GROUPS.* * 


By C. H. Epwarps, JR. 


1. Introduction. If H and F are homeomorphic closed subsets of the 
n-sphere S”", then a necessary condition that an arbitrary homeomorphism 
carrying onto F will admit an extension to 8” is that the fundamental 
groups 7,(S"—#) and 7,(S"—F) of their complementary domains be 
isomorphic. O. G. Harrold [4] has investigated the problem of imposing 
local conditions on the complements of certain homeomorphic closed sets in 
S" which will suffice to imply that the corresponding fundamental groups 
vanish. He shows in particular that if C is an i-cell in 8” with 8" —C having 
uniformly abelian local fundamental groups, then 7,(8"—C)=—1. A 
generalization of the proof of Harrold is here employed to extend this result 
to a certain class of contractible (n—1)-dimensional curvilinear polyhedra 
imbedded in 8”. 


2. Notation and definitions. The notation and definitions used here 
are the same as those used by Harrold in [4]. In particular, the open set A 
in the space XY will be said to have uniformly abelian local fundamental 
groups provided that for each point p€ X and each neighborhood U of p in 
A, there is a neighborhood V of p such that VCU and such that every closed 
path in V which is homotopic (in U) to the commutator of two paths in U 
is nullhomotopic in U. 


3. 


THEOREM 1. Suppose that C ts a curved polyhedral complex imbedded 
in S", n= 5, such that 


k 
(a) C=UDi, i=1,---,k, being (n—1)-cell subcomplexes of C 
such that IntDsN UD=—O, 
iAj 


j-1 . . 
(b) DiA UD, j=2,---+,k, is an (n—2)-cell complex tamely im- 
bedded in the (n—2)-sphere Bd Dj. 


* New version received January 17, 1961. 
* This paper was supported in part by the National Science Foundation, Contract 
G-8239. 
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If S"—C has uniformly abelian local fundamental groups, then 
m,(S"—C) =1. 

The dimensional restriction n <5 arises from the use of the following 
conjecture, which has been proved in the case n <5 by Guggenheim [3]: 


Conjecture A. If P"-? is a polyhedral (n—2)-cell in 8", then Pr? 
is flat in S"* (there exists a piecewise-linear isotopy of S"* carrying P"* 
onto a standard (nm —2)-simplex). 

If Conjecture A holds for all dimensions, then Theorem 1 holds for all 
dimensions. Observe that the tameness hypothesis in (b) is unnecessary for 
ns 5. 


4. Qutline of proof. Generalizing the construction employed by Harrold 
in [4], a separable metric subspace Y — A U M of the Hilbert cube is defined, 
with A being ULC! and homeomorphic to S"—C, and with M being a 
topological (n—1)-sphere constituting the boundary in X of A. M is then 
clearly LC? with 7,(4/) =1. It is shown also that 7,(A,X) —1, so that a 
theorem of EKilenberg and Wilder [2, Theorem 8]applies directly to show 
that 7,(A) which implies that 7,(S"—C) =1, A and 8"—C being 
homeomorphic. 


5. 


5.1. Preliminaries. By the order of a point a€ C will be meant the 
positive integer p"*(a,C) +1, where p"™*(a,C) is the (n —1)-dimensional 
Betti number of ( around the point a [1,81]. Denote by C® the subset 
k 

of all points of C of order k. It is clear that C@ = |) Int D; is an open (in 
i=1 

(’) dense subset of C. 


sufficiently small that S(a,8,) —C =S(a,8) —D;=U.,U Va, with U, and 
V, separated by D; [1,§2]. It may be assumed that the notation is chosen 


Hence given any point a€ IntD;, 1=jSk, 8&0 can be chosen 


in such a manner that 


U;= U, and V;= V, 
a€IntD; a€Int D; 
are disjoint open connected sets separated in U;U V;U D; by Dj. (This can 
be done as in [5, p. 280]). It will be said that a point x € S"—C sufficiently 
near D; lies on the positive or negative side of D; according as x € U; or x€ V; 
respectively. 


Now let {pim}mz, be a countable dense subset of Int D;. For each m 
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choose 8” > €m > 0 such that S(pim, 8m) —C =U"U V™ with U" C U; and 
Vv" C V; and such that any connected subset of S"—C joining a point in 
U"™ with a point in V"™ AQ n,m) must have a diameter > 2em. 
For each m choose an are Ip = pingim, with Im—pim C S(p'm;ém) —C, in 
such a way that each of the two subsequences 


{ | Gin € U;} and { | V;} 


is dense in Dj. 
k 
The countable dense subset LU {p4m}m2, of C is then ordered in such a 
j-1 


manner as to obtain a sequence {p;}i-1 such that p,€ Int D; if and only if 
i=j(modk). That is, the subsequence of {p;};%, which is contained in 


Int D; is precisely the subsequence { Pjsmx}m=o- 


5.2. Definition of A. Proceeding in this section as in 4.6 of [4], 
define a sequence {f;}:% of real-valued continuous functions on S"—C by 
fi(x) =p(a,qi) for «€ S"—C, where the “relative distance metric” p is 
defined on S"” —C by p(2, y) =g.1.b. of the diameters of all connected subsets 
of 8*—C containing both x and y. 

A homeomorphism h carrying S”—C into Hilbert space is defined as 
follows: If -,%) == S*—C, let h(x) = {2,,- - -, tn, f(x), fo(x), - 
Denote by A the image set h(S"—C). Assume that a metric p» is defined 
in A as in 4.64 of [4]. 


5.3. Sinee S"—C by hypothesis has uniformly abelian local funda- 
mental groups, the techniques used in 4.7 and 4.71 of [4] apply with slight 
modification to show that A is ULC. 


j 
5.4. Definition of M. Let Cj; D; for each so that 
i=1 


(, =D, and C;,=C. For each j=1,- - -,k let the metric p; be defined on 
S"—(C; by p;(x.y) =g.l.b. of the diameters of all connected subsets of 
S" — (; containing both z and y, and define & sequences of real-valued func- 

Suppose now that D, is an (n—1)-cell in C; and let U, and V, be the 
corresponding open sets defined in 5.1, such that U,U V, U Int D, is a neigh- 
horhood of Int D, with U, and V, separated in U,UV,UD, by Dy. If 
w€IntD, and i==1,2,:--,j (modk) then define fjj/(7) =lim for 
and fy’(2) lim fy(2) for V,. Ifiz=j+1,- -,k (modk), then 
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define fi; (x) —fij/’(2) =0. Since two points close to x both lying either 
in U, or V, can be joined by a connected set of small diameter, it is clear 
that the functions f;;’ and fj’ are thereby uniquely defined. 

If «€ Int D,, define 


If y€ Bd Dy, define hj’ (x) =limhj (x) and hj’ (y) =limh;”(z) for Int 


Then hj and h;’, }=1,: - -,k, are mappings of D, into a Hilbert cube (with 
the metric referred to in 5.2). If c€ D,NC™, then C; does not separate 
sufficiently small neighborhoods of x, and it follows that h/(r) =hj’(7). 
But if c€ D, A C, k > 1, then C; does separate sufficiently small neighbor- 
hoods of x, and it follows that hj (2). 


5.41. An elementary continuity argument similar to that used in 4.8 
of [4] suffices to show that hj and h;’ are homeomorphisms on D,, so that 
hj/(D,) and hj’(D,) are (n—1)-cells, 7=1,---,k; s=1,---,). The 
manner in which these homeomorphisms match on the boundaries of two 
intersecting cells D, and D, is described in the following section. 


5.42. Suppose that z is a point of C™ common to the boundaries of 
the (n—1)-cells D, and D, in Cj. If 8>0 is sufficiently small, then 
S(z,8) —C; consists of k mutually separated open sets. Suppose that W is 
one of these sets intersecting both U, and U;. Then it follows that lim hj (7) 
with rz, r€ D,, is equal to limhj(y) with yz, y€ Ds. To see this, 
notice that if «¢€ Int D, and y€ Int D, are sufficiently near z, and if <€ U, 
and y€ U, are sufficiently near 7 and y respectively, then < and ¥ are both 
in W and can be joined in 8"—(C; by a connected set of small diameter. 


Taking limits as i> 7, Y—> y, and r—>z<y gives the desired result. 


j 
5.43. Now define M; = [hj (Di)U hj’ (Di) j= k. Clearly 
i=1 
M = M,, is the boundary in XY = AU M of A. 


5.5. Proof that M is an (n—1)-sphere. It will now be shown that 
M = M;, is an (n—1)-sphere. That M, is an (n—1)-sphere follows as in 
4.8 of [4]. The induction proof will be completed by showing that the 
assumption that M; is an (n—1)-sphere implies that J/;,, is an (n—1)- 
sphere, j=1,- - -,k—1. 

For each € Lj,, = Dj,1, define fi;*(x) = lim for x, FE 
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if i=1,- - -,j (modk) and =0 if =j+1,---,k(modk). Then 
define 

It follows from 5.41 that h;*(Z;,,) is homeomorphic with L;,; and is hence 
an (n— 2)-cell. 

Now triangulate I; in the following natural manner: Recalling that 
each D; is a subcomplex of M;, choose for h/(D;) and hj’(D,) the sub- 
divisions induced by the homeomorphisms hj and hj’ and the original 


j 
subdivision of D;. Since Mj= [hj (Di)U hj’(D;)], this suffices to tri- 
angulate M;. 
Since L;,, is by hypothesis a subcomplex of C, it follows immediately 


that h;*(Z;,,) is a subeomplex of M;, that is, that the (n—2)-cell hj* (Lj.1) 
is polyhedrally imbedded in the (n —1)-sphere Mj. 


5.51. Now define a mapping ¢ of U hj” (Di)] onto M; 
by setting (x) for and (x) for 
€ hj’ (D;). Observe that @ is clearly 1-1 bicontinuous except at points 
Of (Lj) U (Lia). Indeed carries each of the (n—2)-cells 
hia’ and (Lj..) homeomorphically onto the (n — 2)-cell (Lj.1), 
and carries the (n—1)-cells hj..’(D;) and homeomorphically onto 
the (n—1)-cells hj (D;) and hj’(D;) respectively, i=1,---,7. It follows 
as in 5.42 that ¢ is single-valued. 

Since h;*(Z;,,) is a polyhedral (n—2)-cell in the (%—1)-sphere Mj, 
there exists, by Conjecture A, a homeomorphism f of 1/; onto itself carrying 
h;*(Zj,1) onto a standard (n—2)-simplex P. 

Now let # be an (n—1)-simplex with base P and opposite vertex p. 
Let q be an interior point of P and let p’ be an interior point of the interval 
qp. Then denote by H’ the (n —1)-simplex with base P and opposite vertex p’. 
Let g be the mapping defined as follows: 

a) g(x) if Cl(M;—E£); 

b) If «€ P, denote by 2 the point of intersection of the segment xp 
with Bd E’—P. Then let g contract the segment xp linearly onto the seg- 
ment zp. Clearly g gives a homeomorphism of M;—P onto M;— EL’, but 
each point a € Int P has two image points: g’(a) =a€ P and g’(a) € Bd L’ — P. 


A homeomorphism y of [hj.s’(Di)U hjx” (Di) ] onto the closed (n — 1)- 
4=1 


cell Cl(M;—E’) is now defined. If € (Dir) VU hj” (Lj), let p(x) 
=9fo(x). Tf define p(x) and if (Lin) 


11 


| 
| i 
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define = 9’ fo(x). It is immediate that y is a homeomorphism, +o 
that UJ (Di) ] is a closed (n—1)-cell. 
i=1 


5.52. It is shown that hia” (Dj) is a closed (n—1)-cell. 
By 5.41, and (Dj) are both closed (n—1)-cells. If 
«€ Int L;,,U Int Dj,., then and (ax) are distinct, while hj,,’(7) 
if c€ BA D;,,— Int Thus and (Dj) are 
two (n—1)-cells meeting in hj;,,(BdD;,; Int Lj,,). Since by hypothesis 
Lj, is tamely imbedded in the (n—2)-sphere Bd Dj,;, it follows that 
Bd D;,, —Int is a closed (n—2)-cell tamely imbedded in Bd D;j,,. 
Hence h;,,(Bd D;,, —Int L;,,) is a closed (n—2)-cell tame in each of the 
boundaries of the two (n—1)-cells hj..’(Dj) and hj” (Dj). It follows 
easily that hj” (Dji1) is a closed (n —1)-cell. 


+1 
Consequently = U (Di) U (Di)] is the union of the two 
i=l 


closed (n—1)-cells U) (Ds) U (Di) ] and (Dias) 
intersecting in their boundary, the (n—2)-sphere hj.’ (Lj.1) 
U Aja” (Lj). Hence M;,, is an (n—1)-sphere, and it follows by induction 
that M is an (n—1)-sphere. 

6. Since M is an (n—1)-sphere, M is LC’ with 7,(M)=-1. Since 
the proof of 4.9 of [4] applies without alteration to show that 7,(A,¥) =1, 
Theorem 8 of [2] now implies that 7,(A) —1. Since A was constructed 
homeomorphic to 8"—C, it follows that 7,(S"—C)—1. This completes 
the proof of Theorem 1. 
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EXISTENCE OF UNIVERSAL CONNECTIONS.* 


By M. S. NarastmHaANn and S. RAMANAN. 


1. Introduction. The purpose of this paper is to prove the existence of 
universal connections for principal bundles with a compact Lie group as 
structure group. We prove (Theorem 2) that given a compact Lie group G 
and a positive integer n, there exist a differentiable principal G-bundle E and 
a connection y, on FE such that’ any connection on a differentiable principal 
G-bundle P with base of dimensibn = n can be obtained as the inverse image 
of the connection y) by a differentiable bundle homomorphism of P into L£. 
As is well-known, the analogous problem for bundles without connections is 
treated in the topology of fibre bundles [1]. 

It is also known that the Stiefel bundles play the role of universal bundles 
for the unitary groups U(k). One can define in a natural way a connection 
on every Stiefel bundle (§2). We prove that these connections themselves 
are universal for connections in U(k)-bundles. <A precise formulation is 
found in Theorem 1. 

In the unitary case the problem is first solved locally by explicit con- 
struction, the crucial step being the lemma in §3. The local solutions are 
then pieced up with the help of a special type of covering by coordinate cells. 


In the general case, the compact Lie group G is identified with a closed 
subgroup of a unitary group. Starting from a universal connection for this 
unitary group, a universal connection for @ is constructed by generalizing 


the usual method of construction of an invariant connection in the principal 
bundle associated with a Lie group and a closed subgroup ([3], p. 45). 

A theorem of A. Weil ([1], p. 57) asserts that the cohomology classes 
of the base of a principal G-bundle obtained by substitution of the curvature 
form of a connection on P in the invariant polynomials of G are independent 
of the connection. Our result seems to explain this invariance and in fact 
furnishes an alternate proof in the case of compact Lie groups. 

For definitions of the notions related to connections in principal bundles 
we refer to [1] and [3]. We use connections and connection forms inter- 
changeably. By “ differentiable” we always mean “indefinitely differentiable.” 


* Received February 10, 1961. 
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All manifolds, bundles, bundle homomorphisms and differential forms are 
assumed to be differentiable. Also all manifolds that occur are paracompact. 
We are thankful to Professor K. Chandrasekharan for his constant 


encouragement and interest. 


2. Canonical connections in Stiefel bundles. Let C% be the N-dimen- 
sional complex number space with O as origin. The Stiefel manifold V (XN, i) 
(with N =k) of all unitary k-frames at O may then be identified with the 
left coset space U(N)/I, xX U(N—k) where I; is the unit (k,k) matrix. 


N 
To every frame With > where (e;) is the canonical 


base in we associate the (NV,%)-matrix A = (a;;) with a;;—0;;. Since 
(v1, is orthonormal, we have i.e. A satisfies the 
a= 

condition A*A =I; where A* is the conjugate transpose of A. Thus V(N,;k:) 
is identified with matrices A satisfying A*A =J,. The action of U (i) 
(resp. U(N)) on V(N,k) to the right (resp. to the left) goes over under 
the above identification into multiplication of (N,k) matrices by unitary 
(k,k&) matrices on the right (resp. by unitary (N,N) matrices on the left). 
Under the action of U(k), V(N,k) becomes a principal U(k)-bundle (known 
as the Stiefel bundle) with the Grassman manifold G(N,k) of k-subspaces 
of C’ as base. G(N,k) may again be identified with the left coset space 
U(N)/U(k)X U(N—k). 

Let S be the (V,%) matrix-valued function on V(N,k) which associates 
to each frame (2,,: - -,v,) the matrix A. Consider the (k,k/) matrix-valued 
differential form S*dS on V(N,k). Since S*S=—J;, for every frame, on 
differentiation we obtain S*dS + (dS*)S =0, or again S*dS + (S*dS)* =0. 
Hence S*d8S has actually values in the Lie algebra u(k%) (which is the vector 
space of skew-Hermitian matrices) of U(k). 


PRoposiTION 1. S*dS is a connection form on the Stiefel bundle V(X, i) 
which is invariant under the action of U(N). 


In fact, if X¢ is a tangent vector at €€ V(N,k) and s€ U(k), we shall 
denote by X¢s the image of XY¢ under the differential of the map é— és of 
V(N,k). Then we have 


(S*dS) (X¢s) — 8* (Es) (Xes) (8) 
= 
= s1(S*dS) (X¢)s. 
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On the other hand, if a€ u(k), we identify a with a tangent vector at e and 
denote by éa the image of a under the differential of the map s—és of G 


into V(N,k). Then 


(S*dS) (ga) = 8(€)* (éa) (8) 
= 8(£)*8(é)a 
== 
since S(é)*S(é) =J;,. Hence S*d8 is a connection form on the Stiefel bundle. 
Moreover, if ¢€ U(N), the left translation of the differential form by ¢ yields 
(tS)*d(tS) = S*t*tdS — 8*ds. 
Remark. This connection will hereafter be referred to as the canonical 


connection and will be denoted by yo. 

The horizontal subspace for this connection at the point & = oe ) of 
V(N,k) may be described as follows: The tangent space at € can be identified 
with (N,N) skew Hermitian matrices of the type is _— where P is a 
(k,k) skew Hermitian matrix and is a rectangular (N,N —k) matrix. 
The horizontal vectors at € for the connection y, are then given by matrices 


of the type ( 0 ; ). This description together with the invariance under 
the action of U(N) characterises the connection y) completely. 

Analogous statements are true for the real Stiefel manifold W(N,k) 
and the corresponding O(k)-bundle. In particular S’dS (where 8’ is the 
transpose of S) is a connection form on the Stiefel bundle, the corresponding 


/ 
horizontal subspace at é being given by matrices of the type i a , This 


is easily seen to be the orthogonal complement of the vertical subspace at & 
with respect to the killing form tr(adxady) on o(k), the Lie algebra of O(k). 


3. The local problem. 


Lemma. Let U be an open subset of R" and V a relatively compact 
open subset whose closure is contained in U. For every differential form a 
of degree 1 on U with values in u(k) (the space of skew-Hermitian matrices), 
there exist differentiable functions $;,° * +,¢m in V with values in the space 


of complex matrices such that 


1) $)*6; = Ik, and 
j=1 
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m’ 
ii) = = @, 


where m’ = (2n + 1)k?. 


Proof. Let f1,- - -,fx2 be a set of positive definite matrices which form 
a base for the complex Hermitian matrices over the reals, such that || f, || =1 


for every r (|| f || being the norm as a linear transformation). Since «@ has 
n Ke 


values in u(k), we may write in U as where are real- 
g=1r=1 


valued functions and 2, the coordinate functions in R". If a,,,—=sup | d,,s|, 
we have = pr,s — Where 


+ Ar, s 1} and 
1} 


are both strictly positive differentiable functions. Hence we may write 
= ANd Where p,,, and are positive differentiable func- 


tions. Clearly one may assume that }\(u,,s-+ 7,5) is bounded by 1/2k? on 
s=1 
V for every r, by altering the coordinate functions z, by a constant multiple, 
if necessary. The matrix valued function + }f, is then 


positive. For, 
|2 + vee) fr || S 1/2k? || fr || < 


Let g, be the (unique) positive square-root of the positive matrix f, and h, 
the differentiable positive matrix-valued function satisfying 


We now define Mt, (C)-valued functions as follows. 


For 1=jS nk’, ¢; shall be the nk? functions p,,,e'**-g, arranged in 


some order. 

For nk? +15] S 2nk’, 4; shall be the nk? functions q,,,e-## - g, arranged 
in some order. 

For 2nk* +17 5(2n+1)k?, 4; shall be the functions h, in some order. 

We have to verify that the ¢; thus defined fulfil the conditions i) and ii) 
of the lemma. In fact, 
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j= Tr 


== 


On the other hand, 


= 2 Dy, (das) + 
h,dhr 
= + = + = h,dh, 
2 Vr,3) + d( pris? + fr + hrdhr 


But since for any z,y€ V, h,?(x) and h,?(y) commute, their positive square 
roots h,(a) and h,(y) also commute. It readily follows that h,dh,— dh,- hy. 
Hence $d(h,?) =h,dh,. Therefore, finally we have 


== a, 
since r,s) fr + and the lemma is completely proved. 
The problem is solved locally by the following 


PROPOSITION 2. Let P be a principal U(k)-bundle over a manifold X of 
dimension =n and y a connection form on P. For every relatively compact 
open subset W of X with W contained in a coordinate neighborhood U of X 
over which P is trivial, there exists a differentiable bundle map ® of p*(W) 
into V(m”,k) such that the inverse image of the canonical connection +o 
by ® is y, where m” = (2n+1)k® and p is the projection PX. 


Proof. Leto be a section of P over U and @ the inverse image of y by o. 
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By the lemma, we can find differentiable Mt, (C)-valued functions dy 
in W such that 


i) and 
j=1 


ii) where m’ (2n+1)k. 
j=1 


Define a map ® of P over W into the space of (m”,k)-matrices by setting for 
$1 (pé) 

E€P, &(€) = where s€ U(k) is determined by §=—o(pé)s. 
bm’ (pé) 


® is easily seen to be a bundle homomorphism. We then have 


a= §*3, by (i) 
= I, since s is unitary. 


Hence @ maps P|p*(W) actually into V(m”,k). On the other hand, it is 
obvious that the inverse image by ® of yp = S*dS is given by ®*d®. But the 


m’ 
inverse image by o of &*d@ is = > by construc- 
4=1 


tion. Now y and ®*d® are two connections on P|p*(W) such that their 
inverse image by the section o are the same. Hence y—®*d® on p*(W). 


4. Universal connection for the unitary group. 


TuHEoREM 1. Let P be a principal U(k)-bundle over a manifold X of 
dimension =n and y any connection form on P. Then there exists a differ- 
entiable bundle homomorphism & of P into the Stiefel bundle V(m,k) such 
that > is the inverse image by ® of the canonical connection y. on V(m,k), 
where m= (n+ 1) (2n+1)k*. 


Proof. We can find a covering of X by relatively compact open sets 
{V;} such that i) each V; is contained in a coordinate cell, and ii) the V;’s 
can be divided into (n-+1) classes @; in such a way that no two V;’s of 
the same class intersect ([2], p. 61). Let {W;} be a shrinking of this covering, 
i.e., an open covering {W;} such that W,C V;. Let D; (j=1,- -,n +1) 
be the union of the open sets p-*(W;) where W,; C V; with V; belonging to 6). 

The bundle is trivial over the coordinate cells and hence, by Proposition 2, 
one can find differentiable bundle homomorphisms ®; on p-?(V;) into 
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V((2n+1)k',k) inducing the connection y on p*(V;). Corresponding to 
each D; there exists a {(2n-+ 1)k®,k}-matrix-valued differentiable function 
v on P such that & coincides with & on p*(W;) for V; in @;. Let then 
be the (n-+1) functions thus constructed. Consider a differ- 
entiable partition of unity with respect to the covering {D;} consisting of 
non-negative differentiable functions ¢; invariant under the action of the 
group U(k) such that the support of & C Dj and > ¢? —1. 


Consider now the map ® on P defined by 


£:(€) (€) 
= ( every €€ P. 


(€) (€) 


We have to prove that ® is bundle map of P into V(m,k) such that 
But 


the summation being over those 7’s for which €€ D;. But on Dy, ¥F¥,=—T 
and we have (¥,;*%;) =I for every 7 over which the summation extends. 


Hence &*@(£) = =I, since =1. Moreover 
£, (és) (és) £,(€) (€)s 
(és) ( = ( 
(ES) (ES) Ener (E) (€) 8 
£1 (€)¥1 (€) 
=(£)-s 


for every €€ P and s€ U(k). 


Finally, 


n+1 
i= 


n+1 


n+1 


4=1 


As before, for a £€ P, the summation needs to be taken only over those +’s 


| 
or 
S 
4 
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for which D;. In every such Dj, however, and 
Hence = (S67)«—=« since —4d(S —0. 


5. Universal connections for compact Lie groups. Let G, be a closed 
subgroup of a Lie group G;, and g, and g; their Lie algebras. The group G, 
acts on g, by the adjoint operations and g, is invariant under this represen- 
tation. Suppose m is a subspace of g, invariant under the action of G, which 
is supplementary to go. (Such a space m exists if G, is compact or semi- 
simple.) Let P be a principal bundle with group G, and , a connection 
on P. P is fibred by G, into a principal bundle with group G2. 

The direct sum decomposition g,@m of g; gives rise to a projection 
of g, onto g. which commutes with the action of G, (i.e. roads =adsor 
for every s€ G.). We define a differential form we on P by setting w2 = 7 © «. 
It is easy to see that w, is a connection on P for the fibration by G,. In fact, 
(with the notations of § 2), 


w2(Xgs) = (Xgs) 
= 7: ads w,(X¢) 
= ads- mw, (X¢) 
= s*w2(X¢)s 


for every vector X¢ at €€ P and s€ Gs. 
On the other hand, for every €€ P and @€ ge, we have 
(éa) = (éa) 


= 


=a, since 7 is a projection. 


THEOREM 2. Let G be a compact Lie group and n a positive integer. 
There exist a principal G-bundle B and a connection form y, on B such that 
for every principal G-bundle P with base of dimension =n and any con- 
nection form y on P, one can find a bundle homomorphism f of P into B 
such that the inverse image of y1 by f is y. 


Proof. G can be identified with a closed subgroup of a unitary group 
U(k). Let y. be a universal connection for U(k) (for the dimension n) 
on a principal U(k)-bundle B, whose existence has been proved in Theorem 1. 
G acts on B and makes of it a principal G-bundle. One can define a 


EXISTENCE OF UNIVERSAL CONNECTIONS. 571 


connection y, on this bundle by setting yi 2°. where z is a projection of 
u(k) onto the Lie algebra g of G, as explained above. For any principal G- 
bundle P, with base of dimension <n, let P’ be the corresponding principal 
U(k)-bundle obtained by enlarging the group G. Then there is a natural 
inclusion 1: P— P’ such that i(és) =1(€)(s) for €€ P and sé G. 
Moreover, if y is a connection form on P, one can define a natural 
connection y’ on P’ such that the inverse image of y’ by 7 is y ([8], p. 35). 
Let & be a bundle map of P into B such that the inverse image of y. by ® 
is 7’. We define a bundle map f of P into B fibred by G by setting f=. 
The inverse image of y) by f==®-1 is obviously y. But since y has values 
in g and = is identity on g, we have +: yy and hence the inverse image of 


yi by f is y. 
6. Remarks. 


i) We show how A. Weil’s theorem on connections can be deduced from 
our results, at least when G is compact. Let 1, w, be two connections on a 
principal G-bundle P with base X of dimension =n—1. Consider the 
bundle P I’ X XI’ where I’ is the open interval (—«,1+.), «>0. 
Let a, % be inverse images of 1, w. respectively under the projection 
PXI’—>P. The differential form ta,-+ (1—t)a, where ¢ is the pro- 
jection P x I’ I’, is easily seen to be a connection on PX I’. Let B be a 
principal G-bundle over a manifold M and y,; a universal connection on B 
for dimension =n. It follows that there exists a differentiable family F; 
of differentiable bundle maps of P into B such that the inverse image of y, 
by F; is to, + (1—t)wo. If f; are the corresponding maps of X into M, 
then f, and f, are obviously homotopic. On the other hand, if K, and K, 
are the curvature forms of w:, we respectively, the ‘substitution’ of K., K. 
in each polynomial over g invariant under the adjoint representation of G 
yields closed differential forms B. on X. Then and are the inverse 
images under f, and f, of the form on M obtained by substituting the curvature 
form K of the universal connection in the same polynomial. Since f, and f; 
are homotopic, it follows that 6, and B. define the same cohmology class on 
the base, a characteristic class of the bundle. 


ii) Our method gives a universal connection for the orthogonal group 
O(k) in particular. But the connection was defined in the complex Stiefel 
manifold fibred by O(%) instead of the more usual real Stiefel bundle. We 
have already remarked (§2) that if the points of the real Stiefel manifold 
W(N,k) are represented by (NV, %)-matrices A satisfying A’A ];, (A’ being 


ed 
G, 
ch 
on 
T 
1: 
t 
) 
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the transpose of A), a connection can be defined in a canonical way on the 
real Stiefel bundle with the corresponding connection form A’dA. On the 
other hand, the complex Stiefel manifold V(N,k&) may be imbedded into the 
real Stiefel manifold W(2N,k) by associating to each (N,%) matrix A, the 


Rl 
(2N,%) real matrix A = 


parts of A. It is easy to see that if A*A —I,, we have A’A —I;. Moreover, 
for every s€ O(k) C U(k) 


(40) (inn cas) ) 


a) where R1 A, Im A are the real and imaginary 


Hence the map A—A is a bundle map of the complex Stiefel manifold 
V(N,k) fibred by O(k), into the real Stiefel bundle. The: connection form 
on V(N,k) induced by this map is A’dA, but this is the same as the real 
part of A*dA. If we take for the projection z of §4, the map of u(k) onto 
the Lie algebra o(k) of O(k) defined as the assignment of the real part to 
each skew Hermitian matrix, then the corresponding connection y, is the 
same as the real part of A*dA. In other words, the canonical connection in 
the real Stiefel bundle is universal for O(k%)-bundles. 
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